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X.  IHIBODOCTIOfl 


1.  B1CKGBC0HD 

The  inception  cf  the  all  Volunteer  force  in  1S73 
provided  Army  manpower  planners  with  the  challenge  of 
attracting,  recruiting  and  retaining  high  quality 
personnel.  The  ever-increasing  technology  on  the  battle¬ 
field  coupled  with  budget  constraints  have  forced  manpower 
planners  to  search  for  an  efficient  alternative  to  sheer 
numbers.  The  soldier  of  today  must  be  able  to  operate  and 
maintain  highly  sophisticated  equipment.  In  addition,  the 
Army  manpower  planner  must  also  cope  with  a  decreasing 
supply  cf  18-21  year  olds.  In  fact,  this  cohort  is 
predicted  to  shrink  by  about  15  percent  by  1988  when 
compared  to  the  1979  cohort, and  by  about  25  per  cent  by  1994 
[fief.  1:  p.  2]. 

Of  course,  manpower  shortages  in  the  army  are  nothing 
new.  Past  shortages  have  been  both  quantitative  and  quali¬ 
tative;  the  shortages  historically  have  fluctuated  ever  the 
years  depending  on  the  intricate  balance  among  military 
requirements,  civilian  employment  and  wage  alternatives, 
[fief.  Is  P-1] 

Currently  Army  recruiters  have  eliminated  shortages. 
Through  an  extensive  advertising  campaign,  army  planners 
have  taken  maximum  advantage  of  current  economic  conditions: 
since  the  inception  cf  the  All  Volunteer  Force,  the  army  has 
met  its  objectives  in  numbers  of  enlistees  in  all  but  two 
years  (1177, FI79)  and  has  met  100  percent  of  objective  in 
the  last  foir  years  [fief.  2:  pp.6-7j. 

The  trends  alluded  to  above,  however,  indicate  that  such 
ease  in  manning  the  force  may  be  short-lived.  Army  manpower 
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planners  may  be  forced  to  recruit  "less-qualified”  soldiers 
just  to  Beet  Banning  requirements.  Bore  screening  will  be 
necessary  to  aeet  these  requirements  in  an  adeguate 
fashion. Cne  particular  screen  that  has  seen  widespread  use 
is  edacatioc  level  [Bef.  3:  p.342].  Future  recruiting  may 

not  result  in  the  high  percentage  of  High  School  Diploma 
Graduates  that  is  currently  enjoyed:  from  FY79  through  FY83 
an  average  of  sixty  percent  of  all  non-prior-service  enlis¬ 
tees  have  been  High-School  Diploma  Graduates  £Bef.  2: 
pp.  6-7]. 

The  FY85  Army  Eudget  calls  for  holding  active  end 
strength  to  FY  84  levels  £Bef.  4:  p.16].  This  in  turn 

leads  to  maintaining  an  80.6  per  cent  level  of  High  School 
Diplona  Graduate  content  [Ref.  4:  p.  16].  In  order  to  main¬ 
tain  this  level  and  maintain  FY  84  end  strength,  a  maximum 
of  12  per  cent  of  total  enlistees  may  be  non-high-schcol 
diploma  graduates  (HliSDG)  • 

In  light  of  increased  (due  to  inflation)  or  at  best 
constant  recruiting  costs,  army  manpower  planners  most 
necessarily  be  concerned  with  determining  exactly  what  level 
of  education  produces  the  best  recruiting  risk.  In  ether 
words,  if  Hon- High- School  Diploma  Graduates  and  Graduate 
Equivalent  Degree  enlistees  are  a  necessary  part  of  the 
force  structure*  what,  if  any,  are  the  associations  between 
education  level  and  "performance”?  This  research  effort 
will  provide  some  insight  to  this  question. 

Seme  commonly  accepted  measures  of  performance  currently 
in  use  by  army  manpower  planners  are 

1.  Attrition  (various  definitions  and  levels)  , 

2.  Skill  Qualification  Tests  scores, 

3.  Military  judicial  and  non-judicial  actions  or  lack 
thereof. 

£Bef.  5] 


14 


The  teri  attrition  itself  has  taken  on  nan?  different 
Meanings  in  recent  research.  1c  many  studies,  attrition  has 
keen  defined  as  "failure  to  complete  the  first  term  of 
service. "  [Bef .  6:  p.24]  In  this  study  total  length  of 
service  obtained  vill  be  used  as  a  measure  of  performance  in 
the  initial  analysis  and  the  above  definition  vill  be  used 
in  later  aore  detailed  analysis. 

Hanpover  policy  Makers  have  been  investigating  attrition 
since  tie  early  60* s  [Bef.  7:  p.1].  Such  research  has 
atteapted  to  predict  attrition  through  various  sorts  of 
aodels.  Across  the  Aray,  Havy,  and  Air  Force,  levs]  of 
education,  mental  ability  and  age  have  been  determined  as 
the  best  "pre-service"  predictor  variables  of  attrition 
[Bef •  7:  p.1]. 

The  cost  of  "assessing,  dressing  and  training"  a  typical 


soldier  has  been  estimated  at  approximately  $15000  [Bef.  8 
p.1 6].  This  initial  cost  resulted  in  a  total  cost  of 
$1,742,2C0,C00  in  reaching  FT  83  enlistment  goals,  based  on 
116,215  accessions  [fief.  9:  p.  i ].  Obviously  one  means  of 
Meeting  reguireaents  at  minimum  cost  is  to  reduce 
unnecessary  losses  of  money  through  premature  attrition. 

Attrition  studies  have  been,  for  the  most  part, based  on 
different  forms  of  regression  models,  particularly  linear 
and  logistic  models  using  both  individual  occupations  and 
occupational  groups.  [Bef.  10:  pp.  1-10] 

B.  POBfiCSB  OP  B2SBABCH  B7P0BZ 

The  purpose  of  this  thesis  vill  be  tvofold: 

1.  To  demonstrate  the  usefulness  of  Exploratory  Data 
Analysis  (BOA)  technigues  in  "preprocessing"  large 
vcluaes  of  data  generally  associated  vith  any 
aanpover  analysis.  This  thesis  vill  use  a  study  of 

attrition  of  O.S.  Aray  enlistees  as  the  vehicle  for 
• 
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this  demonstration.  The  dependent  variable  under 
investigation  is  specified  as  "total  active  federal 
service"  .  Ibis  phase  of  the  research  will  provide 
examples  as  tc  hov  EDA  techniques  can  assist  manpower 
analysts  and  decision  aakers  in  deteraining  problems 
in  the  data  under  analysis  and  in  variable  selecticn. 
(a  discussion  cf  exploratory  data  analysis  techniques 
is  found  at  Appendix  A) 

Opon  selecticn  of  suitable  predictor  variables  of 
attrition,  an  analysis  of  survival  functions  will  be 
utilized  to  provide  more  detailed  information. 
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1.  IBS  1111 

1.  ££££  CailSE t  Iii§  ££S£li£tiQD 

Is  stated,  the  data  used  to  fulfil  the  purposes  of 
this  thesis  vas  the  FY79  C0HCR1  file,  naintained  by  the 
Defense  flanpover  Data  Center  (CMDC)  at  Monterey,  California. 
This  COHORT  file  is  a  longitudinal  register  of  all  acces¬ 
sions  for  a  given  year,  updated  at  various  predeterained 
tines  so  as  to  allcv  for  tracking  of  performance  of  that 
cohort  in  subseguent  years.  The  FY79  cohort  under  investi¬ 
gation  vas  last  updated  in  September  1983.  The  file  depicts 
each  individual  through  69  variables  £Ref.  11].  FX79  vas 
arbitrarily  selected  as  a  representative  sample;  it  should 
be  noted  that  the  data  from  any  given  year  may  be  confounded 
by  political,  social,  and  economic  factors  vhich  arc  highly 
subjective  and  difficult  to  measure. 

2.  iisllilaacz  msgsiaallai  iM  Bata  8<?3as&Lflfl 

The  data  set  vas  reduced  based  on  a  reguest  for  an 
investigation  into  ncn-high-school-graduate  performance  from 
the  United  States  Army  Recruiting  Command  (USAREC) ,  Fort 
Sheridan,  Illinois.  This  reguest  and  subseguent  telephonic 
reguests  for  information  suggested  inve<  tigation  into  three 
military  occupational  skills  (BOS):  specifically  at  least 

one  BCS  from  each  of  the  major  subdivisions  of  the  Army, 
namely  Combat  Arms,  Combat  Support,  and  Combat  Service 
Support.  A  histogram  of  the  FT79  accessions  (vith  non  high 
school  diploma  graduate  status)  by  BOS  vas  developed 
(Appendix  B)  and  subsequently  the  BOS’s  rank  ordered  by 
numbers  accessed.  Eased  on  this  ranking.  Table  I  depicts 
the  BCS  *‘s  chosen  for  analysis. 
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TABLE  I 

Bilitary  Occupational  Skills  To  Be  Analyzed 


Ma jc t  Subgroup  of  Army 
Ccmtat  Aras 

Combat  Support 

Ccmtat  Service  Support 


MOS 

1 1B  Infantryman 
1 IX  Infantryman 
13B  Artilleryman 

64C  Motor  Transport 
Operator 

31H  Multichannel  Communica¬ 
tions  Operator 

76Y  Supply  Specialist 
94B  Food  Service 
Specialist 


In  addition,  the  data  set  vis  further  reduced  to 
only  non- prior- service  male  accessions,  based  again  on 
conversations  with  OSAHEC.  Of  course,  all  education  levels 
were  included  so  as  to  be  able  to  ultimately  compare  the 
effects  of  education  level.  A  data  reguest  vas  provided  to 
DMDC  for  the  data  described  above;  the  final  form  of  the 
data  vas  in  character  form,  stored  in  5  files  on  the  Mass 
Storage  System  of  the  Naval  Postgraduate  School  computer 
system. 

3.  Preparation  for  Exp loratorv  Data  Analysis 

Eased  on  the  above  reduction  of  the  data,  the  69 
available  variables  cf  interest  were  reduced  to  1 4  variables 
to  limit  the  scope  of  the  investigation  and  to  demonstrate 
the  use  of  the  Exploratory  Data  Analysis  technigues.  It 
should  be  noted  that  these  procedures  will  be  useful  on  any 
size  data  set  for  any  number  of  variables  subject  only  to 
the  limitations  of  the  storage  capacity  of  the  computer 
system  in  use.  Table  II  provides  a  listing  of  this  first 
selection  of  variables. 
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TABU  II 

Initial  Variables  of  Interest 


£SJEliJ22lm  laclaileg  i ezsis 

High  School  level  Obtained  13 

Current  Jay  Grade  31 

Barrtal  Status  (current)  2 

Muster  of  Dependents  (current)  9 

Character  of  Service  4 

Beenlistient  Code  4 

Age  at  Entry  15 

High  School  Level  at  Entry  13 

Sex  2 

Race  _3 

Ethnic  Cede  20 

Harital  Status/So. cf  Dependents  at  Entry  20 
AFCT  Group  (Hental  Category)  3 

mAafels  is 

Total  Active  federal  Service  Huaber  o i 


isisia 

Huaber  of  aonths 


Total  active  Federal  service  was  chosen  a a  the 
dependent  variable  at  this  stage  of  the  analysis  to  allow 
for  investigation  of  possible  associations  with  the  abeve 
candidate  predictor  variables  over  tiae  as  opposed  to  a 
"go-no-gc"  binary  representation  of  attrition.  This  depen¬ 
dent  variable  allows  the  decision  saber  to  initially  see  the 
effects  of  the  candidate  predictor  variables  on  different 
levels  of  attrition ,  whether  the  assesions  contracted  for 

three  lor  four  years  cf  initial  service. 

1 

|  These  variables  having  been  selected,  sis  pie  FOBTBAH 
and  All  prograss  (Appendix  C  and  D)  were  written  to  retrieve 
the  data  fres  sass  storage  into  an  interactive  environsent 
for  graphical  analysis. 


B.  BITHCDOIOGT 


Bxplcratory  data  analysis  techniques  are  to  be  utilized 
to  analyze  the  data  described  above.  A  draftsman's  display 
[Bef.  12:  pp.  136,145]  is  prepared  to  initially  process  the 
data.  Association  between  variables  of  interest  are  deter- 
■ined  as  well  as  any  possible  errors  in  the  data.  Upon 
necessary  refinement,  further  Draftsman's  displays  are 
utilized  to  select  possible  explanatory  variables.  Bcxplot 
analysis  is  performed  to  analyze  the  distribution  of  the 
levels,  of  the  candidate  explanatory  variables  and  their 
contribution  in  determining  length  of  service.  A  comparison 
of  the  statistics  of  each  distribution  is  utilized  to  deter¬ 
mine  relationships  among  the  various  levels  of  each  of  the 
candidate  explanatory  variables.  Confirmatory  analysis  in 
the  form  of  parametric  and  nonparanetric  hypothesis  testing 
is  presented  to  indicate  the  statistical  significance  of 
sample  comparisons. 

finally,  a  survivor  function  approach  is  utilized  to 
analyze  for  probabilistic  relationships.  Failure  times  and 
survival  times  are  identified  that  lead  to  calculations  of 
the  probability  of  attrition  and  reenlistment  for  both  the 
three  year  enlistees  (3T0)  and  the  four  year  enlistees  (410) 
from  the  FT79  C080BT  data. 
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III.  JU122SJ  Jim  kUUSll 

A.  I1ITIAL  DBAFTSBAl'S  PIS fill 

la  described  ia  Appendix  A,  the  draftsman's  display 
[Bef.  12:  pp.  136,145]  is  an  efficient  means  of  taking  a 
"first  glance"  at  a  fata  set.  In  addition,  the  use  of  1PL 
as  a  programming  language  fcr  this  analysis  allows  fcr  rapid 
manipulation  of  large  arrays  in  a  user  friendly  fashion.  1 
ten  percent  sample  cf  the  data  set  consisting  of  the  vari¬ 
ables  listed  in  Table  II  above  was  run  through  the 
"draftsman"  program  [Bef.  13].  Sampling  was  performed  by 
reading  every  tenth  record  of  the  data  set  provided  by  CBIC. 
The  file  was  prepared  in  Social  Security  Account  Humber 
order  so  10  percent  sampling  ensured  that  a  country-wide 
sample  was  created.  Also,  the  file  being  longitudinal,  any 
length-biased  sampling  problems  were  avoided  [Bef.  14: 
P«13]«  The  output  produced  was  a  14  by  14  matrix  cf  two 
dimensional  scatter  plots.  A  segmented  copy  of  the  display 
is  found  at  Appendix  £•  Befer  to  this  Appendix  for  the 
following  discussion.  Hote  that  the  data  has  been 
"jittered"  to  reduce  cverlap  of  the  data  or  data  points  with 
the  same  discrete  values  [Bef.  12:  p.21].  (note: coding  of 

the  variables  is  defined  in  [Bef.  11]  ) 

First  an  overall  view  of  the  entire  display  is  very 
useful  tc  an  analyst  in  several  ways: 

1.  Categorical  data  is  rapidly  identified  by  the 
"blocking  effect"  seen  in  most  of  the  displayed  vari¬ 
ables,  e.g.,  "Harital  Status"  vs.  "BOS."  This  aspect 
of  the  display  is  critical  in  allowing  the  analyst  to 
"see"  the  data  and  immediately  determine  where  dummy 
variables,  for  example,  may  be  necessary. 
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2.  Also  coding  of  the  variables  is  displayed  as  in  "Hs 

at  entry  "  vs.  "Total  Service"  One  observer  that 
the  scale  of  education  ranges  froa  0  to  13,  with  the 
sajozity  of  the  data  grouped  at  4  and  13,  which 
corresponds  to  2  years  of  high  school  and  a  Graduate 
Egui valency  Diploma  (GED)  ,  respectively. 

Fortunately,  in  this  analysis,  the  file  description 
containing  the  coding  schedule  was  available;  one  can 
envision  the  csefulness  of  the  graphical  technique  in 
"encoding"  data  sets  that  may  not  have  an  accompa¬ 
nying  description.  The  analyst  could  then  recode  as 
necessary  with  many  of  the  commonly -used  file 
mazagement  systems  available. 

3.  Errors  in  the  data  may  be  identified  in  a  rapid  and 
efficient  manner.  Again  referring  to  "HS  at  entry" 
vs.  "Total  Service",  the  majority  of  the  data  is 
shown  to  be  2  year  high  school  level  (4)  and  GED 
(13).  How  the  official  request  for  data  to  DHDCwas 
for  all  education  levels.  Because  of  a  simple  misun¬ 
derstanding  and  a  misplaced  operand  in  the  code  jthat 
extracted  the  data  froa  the  master  cohort  file,  only 
HBSG  data  were  provided.  The  use  of  the  display, 
then  allowed  for  the  prevention  of  the  costly  mistake 
the  unsuspecting  analyst  may  have  made  in  developing 
a  model  with  ezroneous  data.  This  error-preventing 
aspect  of  this  procedure  manifested  itself  in 
subsequent  displays  as  will  be  discussed  later. 

Q.  Although  further  analysis  was  not  performed,  the 
display  allows  the  analyst  to  determine 
aulticollinearity/interaction  effects  of  the  concomi¬ 
tant  variables.  For  example,  "Age  at  entry" 
vs. "Humber  of  dependents"  plot  may  provide  tho 
impetus  for  further  study,  if  either  of  the  variables 
hed  a  visual  effect  on,  say,  total  service. 
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The  use  of  the  entire  display  is  simple  and  "intuitive". 
It  allows  an  analyst  to  bridge  the  gap,  at  least  in  seme 
fashion,  between  the  quantitative  world  of  the  analyst  and 
t»*  "real"  world  of  the  decision  maker  through  the  power  of 
the  brain's  visual  correlation  abilities. 

In  the  problem  at  hand,  that  is,  the  effect  of  various 
personal  characteristics  on  performance  (in  the  fora  of 
attriticn  or  nore  generally,  total  service),  analysis  of  the 
first  ccluan  of  the  display  is  most  revealing.  Again,  this 
data  set  has  been  discovered  to  only  contain  NHSG  and  GED; 
the  entire  spectrua  of  education  levels  will  be  analyzed  in 
a  subsequent  draftsman's  display.  .The  first  column  of  the 
display  depicts  scatter  plots  of  all  independent  variables 
versus  total  Federal  service.  To  aid  in  the  discussion, 
seven  of  tbe  plots  have  been  reproduced  in  Figure  3. 1  and 
the  regaining  seven  in  Figure  3.2  below. 

Viewing  both  figures,  no  rapidly  discernible  or 
"glaring"  associations  are  evident.  This  is  largely  a 
result  of  tbe  sheer  size  of  the  data  set. 

Bovevex,  much  useful  inforaation  is  available: 

zisiics  1-J#  iss. sna  lalai  s§mqg: 

•  This  figure  indicates  that  the  largest  number  of 
assesions  were  BOS  1 1B  (a  "1"  in  the  DBDC  coding) 
followed  by  13B  (2  on  coding  scale).  No  Ill's  are 
discernible  because  this  entry  level  "basic  foot 
soldier"  BOS  was  not  created  until  1980.  BOS  €4C 
seeas  to  have  a  distinct  break  in  length  of 
service:  this  break  indicated  that  perhaps  further 
analysis  is  needed.  One  possible  explanation  nay 
be  that  a  portion  of  the  accessions  entered  this 
BOS  only  for  the  training,  and  subsequently  left 
the  service  at  first  opportunity  through  an  adain- 
istrative  discharge.  BOS  76 1  (6  on  DBDC  scale) 

.  seeas  to  be  the  aost  successful  in  teras  of 


Figure  3. t  First  Colusa  of  Draftsman* s  Display 

performance  (i.e. ,  cospletion  of  at  least  36  months 
of  service)  . 

liaaia  2-1#  kin  Versus  Total  Service: 

•  This  variable  coabination  is  coded  from  1  through 
8,  with  1  being  a  Category  7.  This  lowest  category 
corresponds  to  an  AFQI  total  score  of  9  or  less.  A 


more 


slight  tread  is  evident:  as  AFQT  increases, 

data  points  are  found  in  the  higher  total  service 
range.  This  graphical  analysis  agrees  with  the 
literature  presented  in  the  introduction  of  this 
thesis.  This  also  indicates  that  most  of  the 
accessions  considered  here  ( NHSG  and  GF.D)  are  found 
in  categories  3,4, 5,6,  which  corresponds  tc  mental 
categories  IVB,IVA ,IIIB,  and  IIIA,  respectively. 
This  leads  to  the  conclusion,  as  expected,  that 
NHSDG  accession  performance  on  the  AFQT  is  in 
consonance  with  education  level. 

3.  Figure  3.1,  Marital  Status  Versus  Total  Service: 

•  This  plot  indicates  that  most  NHSDG  were  single 
with  no  dependents  (1 0  on  the  DMDC  coding  scheme). 
Again  the  mass  of  data  points  requires  further 
subsequent  analysis.  The  upper  grouping  depicts  a 
slight  increasing  trend  as  accessions  differ  by 
number  of  dependents  (20=married/no  deps., 
21=married/1  dep.,  etc.).  This,  along  with  ether 
aspects  will  be  analyzed  in  more  detail  in  section 
B,  this  chapter. 

4.  Figure  3.J,  Ethnic  Code  Versus  Total  Service: 

•  This  plot  indicates  that  a  great  preponderance  of 
the  accessions  were  "other"  which  corresponds  to 
Caucasian  in  the  DMDC  coding.  Puerto  Ricans  (cede 
4)  indicate  increased  total  service,  leading  to  the 
conclusion  that  race  is  an  important  predictor  of 
service  and  attrition. 

5.  Figure  3- .1,  Bace  Versus  Total  Service: 

•  This  plot  reinforces  that  of  the  ethnic  code  plot. 
Most  assesiens  were  white,  as  indicated  by  the  mass 
of  data  at  code  1.  There  is  a  slight  increase  of 
service  indicated  in  the  category  2  corresponding 
to  blacks,  and  perhaps  even  a  greater  massing  in 
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the  50*60  month  area  of  category  3,  which  eerie* 
spends  to  race  "other".  Again  race  seems  to  be  a 
predictor  of  service  or  attrition. 

IlSJlIS  l.J,  §§2  Versus  lotai  Service; 

«  Although  the  data  reguested  from  DM DC  was  for  male 
accessions  (coded  1) ,  at  least  a  few  data  feints 
are  evident  in  the  category  coded  "2",  ccrres* 
ponding  to  sex  "female".  Subsequent  analysis 
through  an  API  program  written  to  "scan"  the  data 
and  count  the  frequency  of  certain  data  elements 
(Appendix  3)  indicated  that  in  excess  of  200  female 
HHSDG  persons  were  accessed  and  recorded  in  this 
file.  Again  the  draftsman's  display  and  more  gener* 
ally  the  exploratory  data  analysis  approach 
indicated  erroneous  data,  perhaps  preventing  faulty 
analysis. 

Zisscs  Salsi  isisas  I&tai 

Sfiiiss: 

•  Previously  discussed  above.  This  variable  has  seen 
widespread  use  as  as  a  screening  device  in 
recruiting  £Eef.  7:  p.  1  ]. 

Figure  £-2,  4<=e  at  Entry  Versus  Total  Service; 

•  This  variable  combination  indicates  a  wide  range  of 
values  because  of  the  more  "continuous"  nature  of 
the  age  variable.  Although  most  HHSDG  accessions 
were  in  the  18*20  year  category,  this  plot  indi* 
cates  a  slight  increase  in  total  service  as  age 
increases  to  about  25.  Then  the  plot  may  indicate 
that  older  accessions  do  not  fare  as  well  in  the 
measure  of  performance  chosen.  This  agrees  with  the 
literature;  further  investigation  is  deemed 
necessary  by  the  display. 
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rig ore  3.2  First  Column  Continued 

ZlSSIS  2-2#  toalAsUsal  SQ.de  Versus  Total,  Service: 

•  This  plot  provides  information  that  premature 
attritors  and  successful  first-  tour  completers  nay 
receive  the  same  reenlistment  code.  &  "0"  indi¬ 

cates  unknown  or  no  code,  a  "1n  eligibility  to 
reenlist,  a  N2"  that  a  local  bar  to  reenlistsent 


has  bees  affixed,  a  "3"  that  a  DA-level  EAB  has 
bees  applied,  and  a  "4"  that  the  soldier  is  totally 
ineligible.  The  DHDC  file  description  only 
provided.  2  cedes:  again  the  display  has  indicated 
erroneous  data  or  perhaps  changed  codes  subsequent 
to  latest  file  update.  This  display  prompted  a 
telephonic  clarification  to  DHBC  that  nay  not  have 
been  made  in  the  absence  of  exploratory  data  anal¬ 
ysis.  Since  both  "failures"  and  "successes"  can 
receive  a  cede  of  "eligible",  this  variable  is  not 
useful  as  a  predictor  of  attrition. 

10.  Figure  3. 2,  Character  o£  Service  Versus  Total 

§€£Zige: 

•  This  variable  combination,  like  reenlistment  cede 
above,  is  ar  indicator  of  "after  the  fact"  perform¬ 
ance,  either  premature  attrition  *  or  successful 
first-term  completion.  Again,  a  "0"  indicates 
unknown  or  uncoded  data,  a  "1"  an  honorable 

discharge,  "2"  under  honorable  conditions,  a  "3" 
other  than  honorable  conditions,  and  a  "4"  a 

dishonorable  discharge.  A  massing  of  the  data  in 

the  0-20  month  area  of  honorable  discharges  indi¬ 
cates  that  premature  attritors  were  granted  such  a 
discharge  through  perhaps  the  Trainee  Discharge 
Program  or  Expeditious  Discharge  Program.  Thus 
this  variable  combination  is  not  a  good  predictor 
of  performance  since  discharge  award  in  seme  cases 
is  based  on  the  local  commander vs  decision.  Quite 
often  the  honorable  discharge  may  have  been  granted 
to  speed  the  discharge  of  a  substandard  soldier; 
Under  Honorable  Conditions  discharges, Other  Than 
Honorable  discharges  and  Dishonorable  discharges 
require  more  "red  tape"  and  administrative  delays. 


11.  Fioare  Humber  Dependents  Versus  Total 

SSL liSS* 

•  This  plot  is  an  updated  data  entry  that  allows  for 
the  tracking  of  the  addition  of  dependents  through 
the  serviceman's  tenure.  Thi  plot  indicates  a 
majority  of  the  service  people  considered  through 
September  1983  still  had  zero  dependents  (coded 
"1").  As  number  of  dependents  increase,  total 
service  seems  to  increase,  although  there  is  a 
proportion  cf  data  points  with  more  dependents  that 
failed  to  cciplete  36  months  of  service.  This  plot 
indicates  that  with  further  analysis,  number  of 
dependents  may  be  a  predictor  of  total  service  and 
attrition. 

12.  giasts  i fliiilai,  Slaiaa  isssus  Satai  seoisg 

•  This  variable  is  an  updated  variable  indicating 
current  marital  status  of  the  service  member,  a  "2" 

« 

for  married  and  a  "1"  for  other  £Bef.  11].  The 
plot  indicates  a  majority  of  the  service  members 
are  still  single.  Harried  service  members  seem  to 

demonstrate  an  increase  in  total  service,  indi- 

! 

eating  that  this  variable  is  a  candidate  predictor 
variable  regaining  further  analysis.  Again  ,  this 
agrees  with  the  literature. 

13.  gjaais  1-2/Eai  siafla  isibas  service: 

•  Although  pay  grade  is  not  really  a  relevant  means 
of  predicting  performance  of  accessions,  this  plot 
gives  a  goed  indication  of  the  "intuitiveness”  of 
the  graphical  depiction  of  the  data.  This  plot 
indicates  an  "ideal"  upward  trend:  as  months  of 
total  service  increase,  the  mass  of  data  moves 
upward  and  to  the  right.  High  performers  are  those 
that  receive  promotions  earlier  than  the  mass  in 

.that  particular  pay  grade,  for  example,  grade  E5 
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indicated  bj  a  5  on  the  vertical  axis  of  the  plot 
shows  that  the  majority  of  the  service  m embers  were 
promoted  in  the  46-60  month  time  frame,  which  is 
the  norm.  However,  observe  the  points  in  the  38-40 
month  area,  indicating  waiver  or  early  promotion. 
From  this  plot,  one  can  ascertain  some  idea  of  the 
number  of  achievers  in  this  cohort.  Along  the  same 
lines,  the  poor  performers  are  evident  in,  for 
example,  the  El  grade  in  the  50  month  time  frame. 
The  entire  plot  seems  to  indicate  that  there  is  the 
expected  upward  mobility  of  the  average  soldier. 

14.  Figure  .3.2,  High  School  Obtained  Versus  Total 
Service: 

•  Although  massive  data  size  obscures  the  plot,  this 
plot  does  indicate  the  some  of  the  HHSG  accessions 
have  completed  their  6ED  requirements  (aN6n  cn  the 
scale)  during  this  time  frame.  Of  these  that  have 
completed,  a  slightly  increasing  trend  is  evident, 
again  reinfcrcirg  the  literature  that  education 
level  is  a  suitable  independent  or  predictor 
variable  on  attrition. 

In  summary  this  first  draftsman's  display  has  demon¬ 
strated  that  the  graphical  (EDA)  procedures  are  critical  in 
identifying  erroneous  data,  determining  variables  of 
interest,  and  identifying  nulticollinearity/i.  ^.erection 
effects.  How  a  more  refined  version,  with  an  even  further 
reduced  list  of  candidate  variables  and  in  some  cases,  with 
variables  that  have  been  recoded  so  as  to  be  more  intui¬ 
tively  appealing  was  produced.  This  display  is  in  segmented 
form  at  Appendix  F.  This  version  of  the  display  was  devel¬ 
oped  from  the  more  general  data  set  of  FY79  accessions,  this 
time  including  all  education  levels. 
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B.  BIT1S20  DHAFTSHAl'S  DISPIAI 

Ihe  initial  exploratory  analysis  above  revealed  that 

1.  the  data  cn  hand  was  not  suitable  in  that  it  did  not 
contain  High  School  Diploma  Graduates  (HSD 6)  for  use 
in  ccaparing  any  effects  of  education  on  performance; 

2.  tie  list  of  variables  under  investigation  could  be 
fcrther  reduced. 

The  purpcse  of  this  second  iteration  of  draftsman's  analysis 
vas  tc  serve  as  a  final  check  on  the  data  and  to  determine 
any  other  relevant  information  from  the  data  prior  to  a  more 
detailed  investigaticn  utilizing  other  methods. 


TABLE  III 

Seduced  Variable  List  for  Further  Analysis 


Dependent  Va&afeig 


Total  Service 


iAElflttatgiy  lacjablg 


Hilitary  Occupational  Skill  (HOS) 
Harital  Statu s/Huaier  of  Dependents 
Bace 
Sex 

Level  of  Bducation  at  Entry 
Age  at  Bntry 
dental  Category 


S sa seal  igcfaaanss  Ifti4satp.i§ 

Be  enlist  lent  Code. 

Character  of  Service 


The  data  shortfall  in  education  level  alluded  to  above 
was  solved  in  the  submission  of  a  reguest  to  DHDC  for  a  acre 
complete  data  set.  This  second  data  set  vas  received  and 
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again  stored  in  5  files  in  the  Hass  Storage  System  of  the 
HPS  Computer  System.  The  FY79  COHOBT  file  consisted  of 
.  30778  records  of  data.  Files  for  the  FY80-FY83  COHOE1S  were 
also  acguired  for  later  model  validation  and  subsequent 
research.  It  should  he  noted  that  the  target  data  reguested 
was  tc  he  non- prior-* service  (HPS)  male  enlistees.  The  vari¬ 
ables  under  investigation  for  this  phase  are  listed  in  Table 
III  •  The  last  two  variables  in  the  table  were,  as  previ¬ 
ously  stated,  not  to  be  considered  as  predictors  hut  as  a 
means  of  assessing  general  performance  of  the  enlistees. 

Again,  FOBTBAN  and  APL  programs  (Appendices  C,E)  were 
utilised  to  retrieve  the  data  from  mass  storage  and  to 
manipulate  it  into  form  for  interactive  analysis.  A  ten 
percent  sample  of  the  data  was  taken  for  analysis  (3C78 
records) •  The  data  was  again  jittered  to  reduce  overlap. 

An  overall  view  of  the  draftsman's  display  (Appendix  E  ) 
demonstrated  the  following: 

1.  The  new  data  set  is  mostly  categorical  sis  expected. 

2.  All  levels  of  education  have  been  included  as  demon¬ 
strated  by  the  "HS  at  Entry  vs.  Total  Active  Service” 
plot. 

3.  The  total  service  scale  on  all  plots  extends  to  160 
months,  indicating  that  at  least  some  prior  service 
enlistees  have  been  erroneously  included  in  the  data. 

4.  Seme  female  erlistees  have  been  included  as  indicated 
kj  the  "sex  vs.  Total  Service”  plot,  again 
demonstrating  erroneous  data. 

Best  of  the  disccssion  below  centers  on  the  first  column 
of  the  display;  hence  this  column  has  been  reproduced  in 
Figure  3.3  anl  Figure  3.4  Viewing  both  figures  simultane¬ 
ously,  the  massing  of  the  data  points  in  heavily  concen¬ 
trated  "blocks"  demonstrates  the  large  number  of  data  points 
in  the  sample.  The  dimensionality  of  the  problem  is  graphi¬ 
cally  e.vident.  Seme  specific  information  that  can  be 
gleaned  frern  this  display  follows. 
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Figure  3.3  First  Colons  of  Revised  Draftsaan's  Display 

1.  fjgflrs  ClUtall  fi££Jl£aijt2Mi  Skill  IS-  l2l§l 

Service: 

•  The  1 1B  and  13B  occupational  skills  (1  and  3  in  the 

DHDC  coding)  appear  to  have  attracted  the  aost 
enlistees.  Ho  discernible  trend  or  association  is 
apparent  in  the  length  of  service  for  these  two 
coabat  eras  occupational  skills.  Skill  64C  (coded 
4)  deaonstxates  the  "break"  in  length  of  service 
just  as  in  the  non -high- school-graduate  data  in  the 
previous  section.  Occupational  skill  761,  1  (6  on 
the  DBDC  scale)  as  before,  appeared  aost  successful 
with  less  soldiers  attriting  within  36  acntns  of 
service,  or  the  noxaal  first  tour  length.  1 

2.  fiaais  1-3#  issial  Salssso  is-  &£&&&  of  service: 

•  This  variable  corresponds  to  "1FQT  group"  in'  the 
.original  display,  and  has  been  renaaed  based  cn  the 
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information  seen  in  the  first  draftsman’s  display. 
This  variable  has  also  been  recoded  to  reflect  a  1 
for  mental  categories  5  and  4c,  a  2  for  mental 
categories  4h  and  4a,  a  3  for  categories  3b  and  3a, 
and  a  4  for  categories  2  and  1.  This  plot  demon¬ 
strates  the  recruiting  policy  of  targeting  the 
higher  mental  categories  by  the  massing  of  the  data 
in  those  respective  areas. 

&££££  1-2/  flasiiai  gtasas  Min  i§*  Iota! 

ssuiss: 

•  This  variable  is  coded  vith  a  10  for  single  with  no 
dependents  through  19  for  single  with  8  dependents, 
and  then  a  20  for  married,  no  dependents,  through 
29,  for  married  8  dependents.  Most  enlistees  in 
this  data  file  mere  single  with  no  dependents.  The 
married  enlistees  (20  and  higher  on  the  DMDC  scale) 
indicated  a  slightly  increased  total  service, 
particularly  as  number  of  dependents  increased 
(21,22  and  23  on  the  scale) 

liSSIS  2-2#  £J£S  is.  2s ill  Se£vjgg: 

•  The  addition  of  the  high-school-diplona  graduates 
has  not  affected  the  pattern  that  was  evident  in 
the  original  draftsman’s  display:  a  preponderance 
of  whites  (ceded  1)  followed  by  blacks  (ceded  2), 
and  others  (coded  3} is  still  evident.  The  NctherM 
category  still  demonstrates  increasing  length  of 
service . 

liaass  2*2#  isi  23- 

•  As  previously  stated,  this  plot  demonstrates  that 
the  supposed  all  male  non-prior-service  file  has 
included  in  it  a  number  of  female  enlistees.  Also 
note  that  the  massing  of  the  data  indicates  that  a 
large  proportion  of  these  females  attrite  vith  less 
than  twenty  months  service. 
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Figure  3.4  First  Coluan,  Continued 

6.  Uaais  i •a,  fliai  Ssksol  ai  aoiti  is-  latal  Sanias; 

•  The  inclusion  of  nil  levels  of  education  can  ncv  be 
verified.  The  plot  has  been  recoded  utilizing  an 
API  pro  gras  (See  Appendix  0)  as  shown  in  Table  17 

below:  The  aassing  of  the  data  at  position  5  indi¬ 
cates  that  lost  enlistees  in  this  sample  were  high 
school  diplcaa  graduates.  Also,  enlistees  with  two 
to  four  years  high  school  outnuaber  those  with 
equivalency  status.  Ho  discernible  differences  can 
be  observed  in  the  eguivalency  certificate  holders 
over  the  other  levels  of  education  due  to  the 
■assing  of  the  data.  Hote  the  distinct  treat, 
though,  in  the  GEO  length  of  service.  A  grouping 
is  evident  for  zero  to  twenty  aonths  of  service. 


11BLE  IT 

Becoding  of  Education  Levels  foe  Draftsaan's 

Display 

1.. ...  Op  to  1  year  of  high  school 

2  . 2  years  high  school 

3  .  3-4  years  high  school,  no  diplcaa 

4.. ...  Graduate  Equivalency  Diploma 

5..  ....High  School  Diploma  Graduate 

6.. .  ...It  least  1  year  college  and  higher 


and  another  for  approziaately  thirty  tc  forty 
aonths.  This  aay  he  partially  explained  by  viewing 
age  at  entry  versus  high  school  at  entry  along  the 
sane  row  as  high  school  versus  total  service  in 
Figure  3.5.  This  saae  break  is  evident:  the 


Figure  3.5  Bov  of  Display  for  Cross  Coaparison 

aajority  of  the  enlistees  with  the  equivalency 
certificate  (GED)  are  20  years  old  and  belov  with  a 
break  belov  20.  Perhaps  this  break  in  age  vlen 
vieved  vith  the  break  in  service  indicate  that  the 
younger  GEE  enlistee  is  not  as  successful  in 
ccapleting  service,  just  as  he  vas  in  not 
coapleting  high  school. 


USUIS  1.4,  ice  at  is.  2otal  jegyice: 

•  The  massing  cf  the  data  indicates  that  most  enlis¬ 
tees  fall  in  the  17-21  category,  is  age  increases, 
tctal  service  again  seems  to  be  divided  into 
distinct  branches.  These  branches  coaid  possibly 
be  related  tc  marital  statas,  number  of  dependents, 
or  education  level,  where  breaks  of  this  natnre 
were  also  evident.  See  the  entire  display  to 
compare  each  of  the  above  mentioned  variable  combi¬ 
nations  versus  length  of  service  and  against  ether 
possible  ccmbinations  of  other  variables  for 
possible  insights. 

Siaacs  1-3#  SiSiliigSagSi  £°££(nE£a£"  code)  vs.  Sotaj 

§££2i£e: 

•  This  plot  shews  that  a  surprising  number  of  enlis¬ 
tees,  both  premature  "leavers'*  and  successful 
(i.e. ,  with  more  than  36  months  service)  "stayers", 
have  an  uncoded  0  reenlistment  code.  This  probably 
indicates  that  the  record  has  not  been  posted  with 
this  inforiation.  Code  1,  corresponding  to 
"eligible  fer  reenlistment",  is  massed  after  36 
months,  indicating  that  completion  of  at  least  the 
first  term  is  a  requirement  for  reenlistment  eligi¬ 
bility.  Cede  2,  a  local  bar  to  reenlistment,  is 
also  massed  around  36  to  40  months  of  service, 
indicating  that  the  decision  to  allow  reenlistment 
is  often  reserved  for  the  end  of  a  soldier’s  term 
of  service.  It  should  be  noted  that  the  bar  to 
reenlistment  can  be  issued  by  the  local  ccmmander 
at  any  time  deemed  necessary,  yet  it  appears  that 
exercising  cf  this  powerful  option  is  not  being 
done.  Code  3,  the  Department  of  the  Army  bar  to 
reenlistment,  indicates  a  uniform  massing  of  the 
data.  This  agrees  with  Department  of  the  Army 


pciicy  to  autoaatically  bar  soldiers  that  have 
received  certain  recognition  as  substandard  through 
judicial#  ncn- judicial,  and  adainistrative  actions. 
Cross  coapazison  with  the  next  column  cf  the 
display  (see  Figure  3.6)  reinforces  this  idea: 
This  plot  is  reenlistaent  code  versus  character  of 


Figure  3 


.6  S 


egaent  for  Cross-Coaparison 


service  awarded.  \  Beenlistaent  code  3  has  the 
aajority  of  the  character  of  service  codes  2  and  3 
corresponding  to  "under  honorable  conditions"  and 
"other  than  honorable  "  discharges  respectively# 
both  cf  which  are  considered  substandard. 
Siailarly#  code  4  in  reenlistaent  signifies 


ineligibility  for  reenlistment,  which  is  noticeably 
less  of  the  data,  contains  the  worst  character  of 
service  code,  a  dishonorable  discharge  indicat  i  by 
a  4  on  the  character  of  service  axis. 

9.  Fig  ere  3.4,  Character  of  Service  vs.  lenc,  Lh  of 
Service: 

•  This  plot  indicates  that  as  length  of  service 
increases  tc  38-40  months,  character  of  service 
code  decreases.  The  DHDC  coding  [Eef.  11]  lists  the 
coding  as  1  for  an  honorable  discharge,  so  one  can 
infer  that  "stayers*1  generally  receive  honorable 
discharges.  Those  unsuccessful  soldiers  with  less 
than  36  months  service  receive  the  majority  of  the 
"under  honorable  conditions",  "other  than  honor¬ 
able"  and  c,dishonoratle"  discharges,  coded  2, 3, and 
4  respectively. 

This  revised  draftsman's  display  analysis  has;  provided 
some  insights  into  those  personal  characteristics  affecting 
length  of  service  and  hence  attrition.  All  seven  possible 
explanatory  variables  under  consideration  have  demonstrated 
at  least  some  effects  on  length  of  service.  The  effects  of 
education  level,  mental  category,  marital  status  and  number 
of  dependents,  and  age  have  at  least  initially  seen  to  be 
most  profound. 

These  displays  have  served  to  provide  an  initial  graph¬ 
ical  view  cf  the  data  that  is  both  intuitively  pleasing  and 
simple.  In  addition  to  identifying  possible  erroneous  data 
entries  and  any  peculiar  coding  of  the  variables,  a  major 
result  of  the  analysis  has  been  a  reduction  in  the  dimen¬ 
sionality  of  the  data  itself.  The  displays  also  aid  in  a 
general  familiarization  with  the  data  under  investigation. 


39 


C.  BCXflOl  IB AL ISIS 

The  boxplot  vill  be  utilized  to  demonstrate  another 
Exploratory  Data  Analysis  technigue  that  provides  a  powerful 
■eans  of  obtaining  acre  information  about  a  data  set.  (See 
Appendix  A  for  a  discussion  of  boxplots  in  general.) 

1.  Education  Level  Versus  Length  of  Service 

In  order  to  demonstrate  the  effectiveness  and  power 
of  bcxplcts,  one  of  the  six  candidate  explanatory  vari¬ 
ables,  high  school  education  level  at  entry,  will  be 
investigated  in  detail.  As  stated  in  the  introduction,  the 
literature  has  pointed  out  that  education  level  at  tine  of 
entry  has  been  a  generally  accepted  predictor  variable  in 
attrition  analyses  during  the  last  ten  years.  Therefore  a 
boxplot  analysis  utilizing  the  category  subpopulation  anal- 
ysis  prograa  in  the  IBB  experimental  GBAFSTAT  package  is 
presented  in  the  following  discussion.  Length  of  service, 
the  chosen  dependent  variable  for  this  portion  of  the  study, 
is  plcttcd  versus  education  level  at  entry. 

The  left  panel  of  Figure  3.7  is  a  depiction  of  a  ten 
per  cent  sample  of  F179  enlistees  as  of  September  1983.  As 
previously  stated,  this  data  set  is  a  smaller  subset 
consisting  of  seven  military  occupational  skills  from  the 
entire  IT79  COHOST  file  from  DBDC.  The  GBAFSTAT  input 
screen  for  this  prograa  allows  the  analyst  to  subdivide  this 
batch  of  data  into  its  seven  component  occupational  skills 
through  the  input  of  a  simple  "category"  vector.  (See 
Appendix  G  for  a  depiction  of  this  input  screen.)  The  y 
axis  is  length  of  total  active  service  in  months,  while  the 
x  axis  indicates  education  level  at  time  of  entry  into  the 
army.  Ihe  son  high  school  diploma  graduate  level  is  indi- 
cated  by  "HESDc"  and  indicates  three  to  four  years  of  high 
school  without  a  diploma.  Education  level  decreases  toward 
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LENGTH  OF  SERVICE  VS.  EDUCATION  LEVEL  LENGTH  OF  SERVICE  VS.  EDUCATION  LEVEL 

NPS  ONLY 


the  origin,  with  a  4  indicating  2  years  of  high  schccl,  a  3 
indicating  1  year  of  high  school,  a  2  indicating  2  jears  of 
junior  high  school,  ard  a  1  indicating  1  year  of  junior  high 
schocl.  1c  the  right  of  the  position  narked  "HHSDG",  educa¬ 
tion  level  increases  until  the  position  narked  "GED"  which 
indicates  a  graduate  equivalency  "diploaa"  or  certificate. 
1  6  indicates  high  school  diplona  graduate,  a  7  indicates  1 
year  cf  college,  8  fcr  2  years  of  college,  9  indicates  3  to 
4  years  college  hut  nc  degree,  and  a  10  indicates  a  college 
degree.  Eleven  and  12  indicate  a  nasters  and  doctorate 
degree,  respectively. 

Viewing  the  entire  display  in  the  left  panel,  the 
hoxplct  provides  a  graphical  statistical  sunnary  of  the 
distribution  of  each  of  the  subcategories  (i.e.,  the  levels 
of  education)  in  a  fora  for  easy  coaparison.  A  table  of 
values  is  also  presented  beneath  the  display.  The  aean  of 
each  suhcategory  is  depicted  by  the  dot  (with  lines 
connecting  the  subcategories).  The  aedian  of  each  suhcate¬ 
gory  is  depicted  by  the  other  dot  in  the  body  of  the  bcz. 
Adjacent  values  and  their  associated  "whiskers”,  or  the 
lines  drawn  froa  the  body  of  the  box  to  the  adjacent  values 
depict  tie  tails  of  the  distribution  of  each  suhcategory. 
Outliers  are  depicted  by  heavy  dots  and  are  defined  as  these 
values  greater  than  1.5  tines  the  int..r  guar  tile  range  of  the 
distribution.  The  lean  of  the  entire  display  across  all 
subcategories  is  depicted  by  the  arrowhead  on  the  y  axis. 

It  is  iaaediately  apparent  that  this  subpopulation 
contains  sone  enlistees  with  prior  service,  indicated  by  the 
outlier  values  in  the  left  panel  of  figure  3.7  that  have 
aore  than  60  aonths  cf  service.  (If  an  enlistee  entered  the 
Arny  on  30  September  1978,  aax  length  of  service  without 
prior  service  is  60  senths  through  30  Septeaber  1983.)  The 
data  reguested  froa  CHOC  was  to  be  non-prior  service  (NES) 
only;  again  the  iaportant  error-indication  guality  of  this 
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technique  is  readily  apparent.  The  non-prior  service  enlis¬ 
tees  can  he  easily  extracted  £roo  the  data  set  in  the 
interactive  node  of  this  package;  by  the  insertion  of  a 
simple  truth  statement  (e.g.,  length  of  service  £60  months) 
the  data  set  is  reduced  to  the  desired  set.  (See  Appendix  6 
for  the  depiction  of  the  input  screen;  selection  is  entered 
in  the  "selection"  area.)  This  feature  is  also  most  useful 
in  the  comparison  of  rertain  n+her  characteristics  of  the 
data  as  will  be  demonstrated. 

The  ability  to  interactively  subdivide  the  data 
according  to  other  variables  of  interest  is  an  extremely 
powerful  tool,  allowing  the  analyst  to  "compose"  his  areas 
of  comparison  and  graphically  determine  associations  in  a 
multivariate  sense.  The  actual  data  is  never  altered  and 
there  is  no  delay  due  to  timely  resubmissions  of  programs. 
In  addition  the  grapMcal  displays  are  rapidly  available  and 
intuitively  appealing  ,  requiring  little  explanation  to 
those  decision  makers  with  less  background  in  classical  data 
analysis  techniques. 

The  right  panel  of  Figure  3.7  indicates  the  results 
of  removing  the  outlier  enlistees  with  prior  service.  The 
scale  of  the  boxplots  is  now  larger  so  the  mean,  median,  and 
spread  can  be  readily  ascertained.  Mote  that  the  mean 
length  of  service  has  been  only  slightly  modified  by  the 
removal  of  the  5  outliers ( determined  by  the  "ALL"  rcw  of 
the  two  tables  belcv  the  plcts, 3078-3073) ,  as  expected. 
Also  note  that  the  shape  of  the  boxes  before  and  after  the 
removal  of  the  outliers  is  the  same,  indicative  of  the 
resistance  cf  the  bcxplot.  The  right  panel  of  the  figure 
shows  that  performance  in  the  form  of  length  of  service 
tends  tc  increase  as  education  level  increases  from  the 
junior  high  level  tc  high  school  diploma  graduate  status. 
College  level  enlistees  and  higher  do  not  demonstrate  this 
trend.  Mote  that  enlistees  with  the  graduate  equivalency 
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certificate  demonstrate  poorer  performance  than  ncn-hich- 
scb.ool-diplcma  graduates  (NHSDG) .  Ihe  variance  of  the 
distribution  of  the  enlistees  with  the  GED  is  greater, 
perhaps  due  to  the  Banner  in  which  the  GEO  may  be  awarded: 
Persons  with  any  level  of  education  can  test  for  and  gain 
the  GEE,  and  numerous  tests  are  available  for  the  certifi¬ 
cate.  Hence  the  large  variarce  may  be  explained  by  the 
"variance11  of  the  means  of  awarding  the  certificate. 

In  the  next  figure,  (Figure  3.8),  the  non-prior- 
service  enlistees  are  again  presented  in  the  left  panel  as  a 
"comparative  other."  In  the  right  panel  of  the  figure,  all 
those  soldiers  who  had  an  uncoded  education  level  and  those 
with  EhD  degrees  have  been  deleted,  again  utilizing  the 
selection  feature  of  the  GltAFSTAT  program.  This  results  in 
a  "smccther"  line  through  the  means,  allowing  a  more  discer¬ 
nible  view  of  the  association  of  each  level  of  education  and 
length  of  service.  Note  from  the  table  that  only  2  persons 
were  in  these  deleted  categories,  hence  they  are  referred  to 
as  "cutlier"  education  levels  in  the  title  of  the  plctv 
These  points  will  reaain  deleted  throughout  the  remainder  of 
this  particular  analysis. 

In  Figure  3.S,  left  panel,  the  previously  discussed 
boxplot  with  "outlier"  education  levels  deleted,  ncn-pricr- 
service  enlistees  is  again  presented  for  further  comparison. 
In  the  right  panel,  utilizing  the  selection  capability,  all 
those  enlistees  with  any  years  of  college  have  b^en  deleted. 
How  the  increasing  trend  is  clearly  evident.  As  education 
increases,  length  cf  service  increases.  Again  these 
soldiers  with  the  graduate  equivalency  certificate  (GEO) 
demonstrate  a  lower  mean  perforacince  than  both  the  nen-high- 
school-graduates  and  the  high  school  diploma  graduates.  To 
isolate  this  trend  even  further,  GED  soldiers  are  deleted  in 
the  right  panel  of  Eigure  3.10  Now,  compared  to  the  left 
panel  of  this  display,  the  upward  trend  is  most  evident. 
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The  distribution  of  each  education  level  is  graphi¬ 
cally  described  in  the  boxplot.  For  example,  in  Figure  3.10, 
the  high-school-diplcma-gra duate  distribution  appears  to  be 
symmetric  as  indicated  by  the  position  of  the  mean  and 
median  relative  to  the  ends  cf  the  body  of  the  box.  The 
distribution  appears  to  have  a  "thicker'*  tail  in  the  lower 
length  of  service  side,  depicted  by  the  longer  whisker  on 
the  bottom  cf  the  boxplot.  Bapid  comparisons  of  the  distri¬ 
butions  cf  each  of  the  subcategories  can  be  done  through  the 
boxplot  analysis. 

Thus  the  use  cf  box  plots  has  indicated  that  level  of 
education  dees  have  ax  effect  on  performance  in  the  form  of 
length  of  service.  It  should  be  reiterated  that  this  phase 
of  the  analysis  is  exploratory  in  nature;  the  comparison  of 
means  could  be  strengthened  through  such  confirmatory  anal¬ 
ysis  as  a  one-way  ANC7A  or  in  a  non- parametric  test  such  as 
the  Krcskal-Rallis  test  of  eguality  of  means.  An  exaaple  of 
this  confirmatory  analysis  will  be  provided  in  a  later 
section  cf  this  thesis. 

2.  The  Multivariate  Capability  qj  the  Boxplot 

Military  occcpational  skill  of  the  enlistee  will  be 
examined  for  any  indication  of  an  association  with  perform¬ 
ance,  and  then  further  analyzed  with  regards  to  education 
level  within  the  occupational  skill.  Seven  military  occupa¬ 
tional  skills  (MOS)  were  considered  in  this  analysis. 
(Befer  tc  Table  I,  previous  chapter)  .  Again  a  ten  percent 
sample  of  the  data  was  utilized  for  analysis. 

Figure  3.11,  left  panel,  provides  the  first  lock  at 
this  subcategory  of  the  data.  Military  occupational  skill 
is  plotted  against  total  active  service.  Each  military 
occupational  skill  is  presented  on  the  x-axis,  beginning 
with  combat  arms  (1  IB,  11X,  13B) ,  combat  support  (31M,  64C), 
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and  ccufcat  service  support  (762,94b).  As  before,  the  distri¬ 
bution  of  each  military  occupational  skill  vith  respect  to 
total  service  is  presented  by  the  boxplot.  Again  note  the 
outliers  in  total  service;  these  same  5  soldiers  have  been 
deleted  in  the  right  panel  so  as  to  expand  the  scale  cf  the 
boxplcts  for  analysis.  Through  the  graphical  depiction  of 
these  distributions,  the  occupational  skill  most  successful 
in  terns  of  service  can  be  readily  determined.  Here,  skill 
76T,  supply  specialist,  demonstrates  the  greatest  success 
followed  by  11 B,  Infantryman. 

The  spread  cf  the  distribution  can  be  observed  in 
the  body  of  the  bcx;  note  that  the  deviation  is  also 
provided  in  the  table  below.  The  111  occupational  skill  is 
noticeably  vacant.  This  HOS  was  created  as  an  entry  level 
of  infantxyzan  in  FT80,  hence  this  data  set  contains  none  of 
this  general  skill.  Comparing  the  three  branches  of  skills 
described  above,  combat  arms  was  most  successful  in  perform¬ 
ance  as  determined  by  viewing  11B  and  13B  boxplots  collec¬ 
tively.  Combat  service  support  was  next  in  order  of  months 
successful  service  followed  by  combat  support. 

The  shape  of  each  of  the  distributions  is  readily 
apparent  in  the  boxplcts.  Each  military  occupational  skill 
appears  to  be  skewed  toward  the  lower  range  of  service  as 
indicated  by  the  postion  of  the  median  inside  the  bcdy  of 
each  boxplct.  The  variances  appear  relatively  constant 
except  fcr  skill  64C.  The  utility  of  the  boxplot  in  deter¬ 
mining  homogeneity  cf  variance,  for  example,  for  subsequent 
regressicn  analysis  or  for  ANOVA  assumptions  is  readily 
obvious. 

In  Figure  3. 12,  left  panel,  the  previous  figure  of 
non-priox-service-only  enlistees  has  been  reproduced  to 
allow  comparisons.  I r.  order  to  isolate  the  effects  of 
education  level,  all  high- school-diploma  graduates  (and  60 
enlistees  vith  at  least  some  college)  have  been  selected  cut 
of  the  subpcpulation,  using  the  selection  capability.  Thus 
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in  the  right  panel,  non-prior-service, high  school  diploma 
graduates  have  been  plotted  against  total  active  service. 
Immediately  note  that  a  higher  mean  service  is  observed  by 
these  tetter  educated  enlistees.  The  variance  of 
occupational  skills  has  been  reduced  indicating  that  this 
subpopulaticn  is  not  as  great  a  risk  vhen  considering  bow 
long  each  will  serve.  Those  soldiers  in  skill  TIE  demon¬ 
strate  the  highest  success  (mean  36.15  months).  Eased  on 
these  graphical  results  high  school  diploma  graduates 
perform  tetter  than  the  average  aon- prior-service  enlistee, 
and  more  specifically,  combat  arms  skills  rank  first 
followed  by  combat  support,  and  then  combat  service  support. 
This  may  seem  counterintuitive  as  technical  skills  required 
are  thought  to  be  higher  in  the  combat  service  support 
branches. 

Figure  3.13  repeats  the  same  sort  of  analysis,  this 
time  comparing  ncn-prior  service  enlistees  total  service  in 
the  left  panel  to  nan-  prior-service  enlistees  that  have  not 
received  a  diploma  (HHSG)  and  those  that  have  received  a 
graduate  equivalency  diploma  (GED).  Npte  that  "success"  has 
fallen  from  31.85  months  to  29.08  months.  Occupational 
skill  761  has  the  highest  mean  success,  and  combat  service 
support  now  leads  in  total  mean  service.  Also  note  that 
occupational  skill  31H  demonstrates  a  much  higher  variance 
when  only  these  non-high-  school  graduates  are  compared  to 
the  total  population.  This  could  perhaps  be  explained  by 
the  lack  of  any  real  standards  in  how  the  GED  is  awarded. 
This  certificate  can  be  awarded  at  any  level  of  education, 
provided  that  one  of  numerous  tests  has  been  passed.  Also 
the  ncn-high-schcol  graduate  with  ,say,  only  a  10th  grade 
level  of  education  could  account  for  extremely  poor  perform¬ 
ance  in  this  somewhat  technical  microwave  operator  skill. 
Cn  the  ether  hand,  those  enlistees  with  an  equivalency 
degree  or  11-plus  years  of  education  from  a  "technically 
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oriented"  school  system  could  account  for  the  high  perform¬ 
ance.  Here  research  could  he  performed  in  the  area  of 
"levels"  of  equivalency  status  and  in  a  demographic  analysis 
of  those  non  high  school  graduates. 

A  comparison  of  high  school  diploma  graduates  to 
non-high-school-diplcna  graduates  is  presented  in  Figure 
3.14  •  Those  soldiers  with  egui valency  certificates  (GED) 
have  teen  included  in  the  non  high  school  graduate  category. 
In  the  left  panel  of  this  figure,  the  high  school  graduate 
category  is  presented  for  comparison.  Again,  this  plot  is 
non  prior  service  only  graduates.  The  right  panel  is  non- 
prior-service, non-high-schocl-graduates  and  equivalency 
certificate  holders.  The  "tetter  educated"  enlistees  ,on 
the  average,  have  outperformed  those  with  less  than  a  school 
degree (as  shown  ty  the  mean  values  both  in  the  boxplots  and 
the  tabled  values  below  the  pi  it).  Skill  767,  supply 
specialist, in  the  right  panel,  indicates  a  tetter  perform¬ 
ance,  (mean  of  33.6)  than  the  same  occupational  skill  with 
a  high  school  diploma  (mean  32.7) .  The  variance  of  the  two 
distributions  is  roughly  the  same.  Again  personal  charac¬ 
teristics  and  the  technicality  of  the  skill  involved  alcng 
with  the  effects  of  mental  group  distributions,  enlistment 
tonus  differences  may  explain  this  anomaly. 

Since  a  difference  has  been  established  in  perform¬ 
ance  based  on  the  certificate  status  at  entry,  the  bcxplot 
can  be  utilized  to  go  even  into  further  detail.  In  Figure 
3. 15, non-high-school-graduates  only  are  compared  tc  high- 
school-diplcma  grad cates  and  in  Figure  3. 16  those  with 
equivalency  degrees  are  compared  to  the  diploma-carrying 
soldiers.  In  each  figure,  _ the  batch  remains  limited  to 
non-prior-service  enlistees,  varying  only  education  levels. 

Figure  3.15  demonstrates  that  the  non-high-schocl- 
graduate  performance  , depicted  in  the  right  panel,  was 
below  diploma  'graduate  performance.  This  is 
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graphically  by  comparing  the  relative  position  of  the  mean 
lines  and  in  the  tabled  value  below  each  plot.  Going 
further,  a  comparison  of  each  individual  occupational  shill 
in  the  left  panel  to  its  counterpart  in  the  right  panel 
indicates  that  the  varied  educational  level  produces  and 
entirely  different  distribution.  This  is  observed  through 
the  location  of  the  respective  means  and  medians,  the  size 
of  the  body  of  the  box,  and  the  length  of  thenvhiskers”  or 
tails  of  the  distribution. 

In  Figure  3.  16,  this  same  difference  in  the  distri¬ 
bution  of  military  occupational  skills  with  respect  to 
education  level  is  again  obvious.  In  every  case,  the  high 
school  diploma  graduates  outperform  the  soldiers  with  equiv¬ 
alency  status.  Again,  GED  holders  exhibit  a  larger  variance 
as  indicated  by  the  body  of  the  box,  indicating  a  higher 
risk  in  attrition. 

i 

The  entire  analysis  presented  on  the  effects  of 
education  level  within  military  occupational  skill  is  sumsa- 

i 

rized  in  Figure  3.17  ,  where  the  baseline  of  ncn-prior 
service  enlistees,  categorized  by  occupational  skill  versus 
length  of  service  is  displayed  simultaneously.  Education 
level  has  been  selected  in  each  plot.  Education  level, 
ailitary  occupational  skill,  and  length  of  service  have  been 
integrated  into  a  single j display.  Any  other  combination  of 
variables  such  as  marital  status,  age,  race,  could  be 
further  selected  to  provide  more  of  a  multivariate  display. 
The  I I A  techniques  combined  with  the  IBM  GBAFSTAT  package 
allows  for  any  combination  of  covariates  in  an  analysis, 
limited  only  by  the  imagination  of  the  analyst.  This 
display  allows  a  rapid  comparison  of  the  effects  of  educa¬ 
tion  level  on  performance  (in  the  fora  of  length  of 
service),  perhaps  providing  a  strong  argument  in  favor  of 
these  graphical  met beds  for  at  least  initial  decisions 
regarding  what  level  of  education  to  recruit.  Again, 
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confirmatory  analysis  is  necessary  for  a  more  refined 
analysis  of  relative  merits  of  alternative  educational 
policies. 

3.  Summary  cf  Boxplot  Analysis  o£  Remaining  Variables 

The  remaining  possible  explanatory  variables  and  the 
two  "general  performance  indicators"  were  analyzed  in  the 
same  manner  as  presented  above.  In  each  case  the  effects  of 
education  level  were  observed  in  a  multivariate  sense  by  the 
use  of  selection  combined  with  the  pairwise  comparison  of 
length  of  service  and  the  other  candidate  explanatory  vari¬ 
ables.  The  actual  bcx plots  are  found  in  Appendix  H.  Again, 
confirmatory  analysis  should  be  performed  to  verify  the 
statistical  significance  of  any  conclusions  draws  from  this 
exploratory  analysis.  A  summary  of  this  analysis  is 
presented  below: 

1.  Rental  Category  Versus  length  q£  Service:  {See 

Figures  H.1  through  H.9,  Appendix  H) 

•  As  mental  category  increases,  length  of  service 
increases.  Category  4c  outperform  Categories 
4b, 4a, 3b. 

•  High  school  diploma  graduates  outperform  the  ncn- 
high-school-graduates  and  eguivalency  certificate 
holders  (GEE)  in  all  categories. 

•  Nc  category  1  soldiers  were  observed  in  the  ncn- 
high-school-graduate  or  GED  categories. 

•  Non-higb-schcol-graduates  outperformed  the  eguiva¬ 
lency  certificate  (GEO)  holders  in  every  category. 

•  Variance  in  the  non-bigh-school  graduate  and  GED 
performance  is  higher  than  that  of  the  di plena 
graduates,  indicating  higher  risk  in  attrition. 
Eguivalency  certificate  (GED)  holders'  variance  was 
observed  to  be  higher  than  non-higb-schod- 
graduates,  again  indicating  a  higher  risk. 
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2 .  Marital  Status  and  Humber  of  Dependents  vs.  length  of 

Service:  {See  Figures  B.  10  -  H.  15r  Appendix  B) 

•  dost  enlistees  were  single  vith  no  dependents  at 
time  of  entry  (92%) 

•  Single  enlistees  with  1  dependent  had  highest  mean 
performance. 

•  In  general,  married  enlistees  outperformed  single 
enlistees. 

•  As  number  of  dependents  in  married  soldiers  at  time 
of  entry  increased,  sc  did  performance,  (note:  true 
up  to  3  dependents.) 

•  High-school-diploma  graduates  led  in  performance, 
followed  by  non-high-school-graduates,  then  G2D 
holders. 

3.  J<L§  Versus  Ifpath  Service:  (See  Figures  H.16 

through  H.24,  Appendix  B) 

•  As  age  increases  (at  time  of  entry)  from  17  to  19 
years,  tctal  service (performance)  increases, 
fallowed  by  a  leveling  off  in  the  19  to  24  year 
range.  Performance  decreases  as  age  increases  from 
24  to  29  years  old. 

•  High-school-diploma  graduates  outperformed  ether 
less  educated  entrants,  non-high-school  graduates 
followed,  then  equivalency  certificate  holders. 

4.  §si  Versus  Length  si  Service :  (See  Figures  H.25 

through  H.30,  Appendix  B) 

•  Hales  outperformed  females-. 

•  Diploma  graduates  (HSDG)  outperformed  all  ether 
education  levels;  Hcn-high-schooi  grads  outper¬ 
formed  the  GID  holders. 

•  Hon-higb-schcol-graduate  females  outperformed  male 
non-higb-schcol-graduates  and  female  high-schcol 
diploma  graduates.  However,  non-high-schocl- 
graduate  females  displayed  the  greatest  variance  in 
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length  of  service,  indicating  a  possible  higher 
risk  in  attrition. 

•  Hon-high-schcol-graduate  faaales  sere  followed  by 
GEE  sales  and  then  HESS  nales  in  variance  is  length 
of  service  (risk) . 

5.  Bace  Versos  length  si  Service:  (See  Figures  H.31 

through  B.36,  Appendix  H) 

•  In  general,  the  "other"  category  were  the  highest 
perforaers,  followed  by  blacks,  and  then  whites. 

•  Bigh- school -dip Iona  graduates  were  the  highest 
perforaers  is  terns  of  length  of  service  in  all 
categories. 

•  Hon-higb-schcol-graduates  outperforaed  the  graduate 
eguivalency  certificate  holders  (GEE). 

•  GEE  blacks  displayed  the  highest  variance  in  length 
of  service,  indicating  a  higher  risk  in  attriticn, 
followed  by  GEE  whites. 

6.  Beenlistaent  Code  Versus  Education  £ev§2:  (See 

figures  H.37  through  B.39,  Appendix  H) 

•  At  least  through  the  hi  ;  i  school  diplona  graduate 
level  of  education,  as  education  increased,  re en¬ 
listees  t  eligibility  increased. 

•  GEE  soldiers  receive*,  about  the  saae  reenlistaent 
codes  as  these  soldiers  wh*.  entered  the  aray  with  i 
years  of  high  school. 

•  A  vide  variance  in  reenlistaent  eligibility  was 
observed,  possibly  related  to  the  fact  that  reen¬ 
listaent  eligibility  is  aostly  up  to  the  local 
ccaaander's  discretion. 

7.  Beenlistaent  Cede  Versus  length  Service:  (See 

Figures  H.39  through  B.44,  Appendix  B) 

•  Approxiaately  30%  of  the  saaple  was  uncoded  at  the 
tiae  of  the  last  file  update,  indicating  that  the 

.bar  to  reenlistaent  aay  not  used  as  often  as  it 
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could  be  as  a  rehabilitative  tool  for  substandard 
soldiers. 

•  Generally,  as  length  of  service  increased,  sc  did 
the  nuater  cf  soldiers  eligible  to  reenlist. 

•  Approximately  50S  of  the  GED  holders  and  of  the 
non-high-schcol-graduates  indicated  a  reenlistment 
code  of  3, corresponding  to  only  3055  in  the  b.;..- 
school-diplcxa  graduates.  This  code  corresponds 
tc  a  Department  of  the  Army  initiated  bar  tc  reen¬ 
listment.  This  proportion  seems  a  bit  unreasonable 
and  more  research  seems  necessary  in  this  multivar¬ 
iate  combination. 

•  High-school-diploma  graduates  outperformed  all 
others,  followed  by  non-high-school-graduates  and 
the  graduate  equivalency  certificate  holders. 

•  Again,  the  GED  holders  demonstrated  the  largest 
variance  (risk)  in  length  of  service,  followed  by 
non-higb-schcol  graduates. 

QteZSSLgj  2f  1jS£S2S  Sducatjgfl  Level:  (See 

Figures  H.45  through  H.47,  Appendix  H) 

•  Generally,  as  education  level  increased  (at  time  of 
entry)  through  high  school  diploma,  so  did  the 
character  of  service  awarded. 

•  GEC  and  non-high-school-graduate  soldiers  received 
about  the  same  treatment  in  character  of  service 
awarded.  Again,  high-school-diploma  graduates 
received  the  largest  proportion  of  the  honorable 
discharges. 

•  College  graduates  outperformed  the  high-schocl- 
diploma  grads;  however  those  entrants  with  one  and 
two  years  of  college  were  outperformed  in  terns  of 
character  of  service  awarded  by  the  non- high  school 
diploma  graduates. 
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•  A  vide  variance  vas  noted  possibly  because  of  the 
cciaanders*  discretion  allowed  in  determining  what 
character,  cf  service  will  be  awarded  tc  an 
individual. 

9.  Character  o£  Service  Versus  Length  of  Service:  (See 

Figures  H.48  through  H.  54,  Appendix  B) 

•  As  length  of  service  increased  so  did  the  number  of 
honorable  discharges  awarded. 

•  Over  half  cf  the  cohort  received  an  honorable 
discharge.  A  large  proportion  of  early  "leavers" 
also  received  this  honorable  discharge, perhaps 
indicating  that  some  commanders  are  very  lenient  in 
their  determination  of  type  discharge  to  be 
awarded. 

•  High-school-diploaa  graduates  receiving  honorable 
discharges  performed  the  best  in  terms  of  length  of 
service,  again  followed  by  non-high-schocl  gradu¬ 
ates  and  the  GEO*s. 

•  A  wider  variance  in  length  of  service  vas  noted  for 
the  non- high-school  graduates  and  the  GED's  who 
received  honorable  discharges,  when  compared  to  the 
high-school-diploaa  graduates.  This  perhaps 
relates  again  to  the  discretion  that  is  exercised 
by  the  local  commanders  in  awarding  discharges.  It 
seems  that  length  of  service  may  not  be  considered 
as  an  indicator  of  "good"  service  by  a  number  of 
ccaaanders  in  the  field. 

£.  HABILE  COlFIBHilCET  AHAIISIS 

Conclusions  drawn  from  the  above  explanatory  data  anal¬ 
ysis  must  be  analyzed  in  a  formal  manner  to  determine  if 
differences,  say,  in  the  mean  performance  among  the  varying 
levels  of  education  at  entry  is  statistically  significant. 
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in  example  is  presented  below;  this  type  of  analysis  should 
be  performed  on  all  conclusions  reached  before  any  policy 
implementation. 

The  one-way- analysis  of  variance  provides  a  well- 
structured  approach  in  testing  the  equality  of  means  among  k 
sample  populations.  In  this  approach,  the  k  populations  are 
assumed 

1.  tc  be  i.i.d  normal  populations,  and 

2.  tc  have  egual  variances. 

is  has  teen  pointed  out,  some  of  the  boxplot  analyses 
above  has  indicated  that  the  egual  variance  assumption  is 
obviously  net  true;  contrarily,  this  difference  in  variance 
was  was  used  as  an  indicator  of  the  "risk'*  involved  in 
recruiting  that  entrant  with  his  particular  qualifications. 
In  those  cases,  nenpara metric  tests  are  available  for 
confirmatory  analysis,  an  example  of  that  type  will  also  be 
presented  below.  See  [Ref.  15;  pp.  492-503]  for  a 
discussion  on  1-way  akova. 

The  actual  calculations  of  the  aNOVa  table  were 
completed  using  an  an  program  contained  in  public  domain  of 
the  Haval  Postgraduate  School  computer  system  (library 
5,OA3660).  A  copy  of  the  program  is  at  Appendix  I.  Results 
of  the  analysis  are  summarized  in  Table  V  Thus  the  null 
hypothesis  can  be  rejected  at  the  .05  level  of  significance. 

In  instances  where  the  homogeneity  of  variance  and 

\ 

normality  of  population  assumptions  are  infeasible,  nonpar- 
ametric  tests  can  be  utilized  for  confirmation  of 
statistical  significance.  The  Kruskal-Wallis  test  was 
utilized  tc  test  the  following  hypothesis; 

BOimean  services  across  all  education  levels  are  egual. 

Bis  for  at  least  one  pair  of  the  population  represented 
by  the  levels  of  education,  the  means  are  different. 
See  [Hef.  16;  pp. 229-237]  for  a  discussion  of  the  K-W 
test. 


/ 
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TABU  V 

Ancva 

Table  for  Testing  Equality  of  Beans 

ZfANOVA  MATR 

ANOVA  WAS 

UPDATED 

1/3/79*  SEE  ANOVAHOW 

FOR 

CHANGES , 

ANOVA  TABLE 

SOURCE 

DF 

SS  MS 

F 

TREATMENT  4 

25933.42  6433. 

35 

21.67 

ERROR 

2993 

896813.33  299. 

14 

1 

j 

TOTAL 

3002 

922746.75 

1 

R-SQUARE 

=  0.028 

i 

OVERALL  MEAN  s 

31.93 

1 

TREATMENT 

EFFECTS 

“4.76  2.69  "1.29  ~ 

3.24" 

12.73  j 

1 

lie  actual  calculations  were  performed  utilizing  tie 
EHDP  statistical  software  program  P3S  on  tie  naval 
Postgraduate  Sciool  Coaputer  System.  See  £Bef.  17: 
pp«  442-443]  for  a  description  cf  tiis  package.  Besults  are 
summarized  in  Tails  11  • 


TABU  TZ 

Besults  of  I— 1  Test  of  Equality  of  Seans 


Variable 
Group 
Bo.  Bane 

1  SOPH 

2  JB 

3  GEO 

4  KESG 

5  BSOG 


1  10S 
Frequency 

10 

752 

196 

504 

1546 


Bank  Sum 

9064.0 

1017353.5 

250661.0 

730567.5 

2517723.0 


Kruskal-tialli?  Test  Statistic  »  75.13 
level  of  Significance  =  0.0000 

Using  Cii-Sguare  Distribution  witi  4  Degrees  of  Freedom 


Tins  the  null  hypothesis  can  be  rejected  at  the  .05 
level  of  significance.  Multiple  comparisons  may  be  calcu¬ 
lated  in  accordance  with  [Bef.  16:  p.231]  to  determine 

which  pairs  of  means  are  different  if  desired. 

I.  SOB MAE I  OF  EXPLOBATOBY  DA 'II  ANALYSIS  EFFOBTS 

1.  general 

Exploratory  Data  Analysis  techniques  have  been 
utilized  in  the  forms  of  draftsman's  displays  and  boxplots 
to  "preprocess"  a  large  data  set.  This  analysis  was 
presented  as  a  demonstration  of  the  power  of  EDA  techniques 
and  tc  provide  initial  insights  into  the  attrition  cf  U.S. 
Army  enlistees  prior  tc  further  analysis.  These  techniques 
have  provided  the  following: 

1.  Familiarity  with  data  set. 

2.  Seduction  in  the  dimensionality  of  the  data  set; 
numerous  variables  were  determined  as  having  no 
appreciable  effects  on  the  dependent  variable  under 
consideration. 

3.  Identification  of  erroneous  data. 

4.  Structure  of  the  data  set  and  variable  coding. 

5.  Information  on  multivariate  and  pairwise  associations 
among  the  variables.  This  information  will  be 
summarized  below. 

6.  An  intuitively  pleasing  form  of  analysis  tc  assist 
analysts  and  decisicn  makers  in  understanding  the 
problem  at  hand. 

.  A  means  of  allowing  the  analyst  to  "compose"  his 
method  of  attack  on  a  large  problem  in  an  interactive 
fashion. 


7 


2.  1£A  and  g.S.  Army  Enlisted  Attrition 

flore  specifically,  in  the  investigation  of  attrition 
of  O.S.  •  Ariy  PI79  eclistees,  the  following  information  has 
been  revealed: 

1.  The  variables  listed  in  Table  VII  below  were  observed 
tc  have  some  effect  on  a  soldiers  performance  which 


has  been  defined  as  his  total  length  of  service. 

2.  level  of  education  at  entry  has  an  effect  on  the 
performance  of  enlistees.  Education  level  seess  to 
interact  with  the  other  variables  listed  in  Table  VII 
above,  prodccing  different  levels  of  performance. 
Other  insights  provided  by  the  analysis  are 

•  High  school  diploma  graduates  demonstrated  better 
performance  than  did  non-high-school-diploma  gradu¬ 
ates  and  those  enlistees  who  had  obtained  a 
graduate  equivalency  certificate  prior  to  entry. 

•  Han-high  school  diploma  graduates  demonstrated 
better  performance  than  did  GED  holders. 

•  GEE  holders  demonstrated  a  larger  variance  in  total 

service  obtained,  indicating  a  higher  risk  in 
attrition  than  did  non- high  school  diploma 

graduates  and  high  school  diploma  graduates. 
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3.  Character  of  Service  and  Heenlistment  Code  are  two 
indicators  of  performance  that  may  also  be  affected 
by  education  level  at  entry;  however,  the  discretion 
exercised  by  local  commanders  in  awarding  these  may 
have  confounded  them  as  suitable  explanatory 
variables. 

4.  Confirmatory  analysis  needs  to  be  performed  after 
exploratory  data  analysis  prior  to  forming  final 
conclusion  for  policy  implementation.  An  example  was 
provided. 


The  EIA  techniques  are  not  to  he  used  in  lieu  of  more  clas¬ 
sical  statistical  analysis;  they  are  to  be  used  in 
conjunction  with  them.  Some  limitations  are 

1.  Acceptance  of  their  use  by  other  statisticians. 

2.  Package  utilized  in  this  thesis  is  in  the  experi¬ 
mental  stage;  others  may  not  be  readily  available  to 
the  analyst.  Since  this  package  is  experimental, 
certain  capabilities  are  still  being  developed.  for 
example,  in  the  the  ANCVA  presented,  the  means  used 
in  the  analysis  were  not  stored  in  a  global  sense  for 
further  analysis  and  had  to  be  entered  "by  hand"  into 
an  A NOT A  program. 

3.  Ccst  of  graphics  capabilities  for  computer  systems. 

4.  Storage  necessary  for  IDA  packages  is  currently  not 
available  for  most  personal  or  desk  top  computers. 

Thus  Exploratory  Data  Analysis  has  been  shown  to  be 
a  useful  technique  in  the  initial  analysis  of  data.  In  the 
next  chapters,  a  more  formal  analysis  will  be  presented. 


If.  SCE7IV0B  FOHCTION  ANALYSIS 


A.  EACKGBCDHD 

lie  Exploratory  Data  Analysis  techniques  presented 
(along  with  the  necessary  confirmatory  analysis)  have  shewn 
that  level  of  education,  sex,  race,  age,  mental  category, 
and  marital  status/n outer  of  dependents  are  candidate  expla¬ 
natory  variables  in  determining  the  total  length  of  service 
for  an  enlistee. 

A  survivor  function  approach  is  utilized  to  gain  farther 
insight  into  these  explanatory  and  their  relationship  to 
length  of  service. 

Suppose  that  the  length  of  service  of  an  enlistee  is  a 
random  variable  X.  The  cumulative  distribution  function  or 
c.d.f.,  then,  can  he  viewed  as  giving  the  probability  that 
an  enlistee  will  "die"  or  " fail”  or  leave  the  army  before  x 
units  of  time,  a  realization  of  the  random  variable  X,  have 
elapsed.  Then  the  quantity 

S(x)  =  1-F  (x)  =  P(X>x)  (egn  4.1) 

called  the  survivor  function,  provides  the  probability  that 
an  enlistee  "survives"  more  than  x  units  of  time.  The 
survivor  function,  for  discrete  data,  is  a  step  function, 
where  the  height  of  the  jump  between  any  two  values  of  x  is 
equal  to  P  (J=x)  .  The  survivor  function  is  estimated  by 
using  the  following  relative  frequency  definition  of 
probatility  r 

S  (x)  =  1-F  (x)  =  (number  of  observations  >  x)/n  (eqn  4.2) 
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where  n  is  the  rusher  in  the  sample  being  considered 
[Hef.  18:  pp.  92-93,  263-264]. 

B.  IPPLICATIOH 

The  survivor  function  is  therefore  a  logical  means  of 
analyzing  enlistee  behavior  with  regards  to  length  of 
service.  for  this  jcrtion  of  the  analysis,  attrition  will 
be  defined  as  failure  to  complete  the  first  term  of  service. 
The  FT79  COHOST  consists  of  three-year  obligated  enlistees 
(310)  and  fcur-year  obligated  enlistees  (4 10)  .  Each  subset 
will  be  analyzed  separately.  The  reenlistment  decision  will 
be  defined  as  completing  greater  than  one  term  of  service. 
Hence,  based  on  the  survival  function  model,  the  fcllcving 
equations  indicate  the  "life"  cycle  of  the  enlistee: 


TABLE  Till 

Enlistee  Life  Cycle  Hodels 


P  (enlistee  will  attrite)  *  P(X<36  mos.) 

*  1-P(li>36  nos.)  =»  1-S(x) 

P (enlistee  will  complete  1  term]  =  P(X=36  mos.) 

*  Ht.  of  jump  at  xa3o 

P  (enlistee  will  reenlist)  =  P  (X>36  mos.)  =  1-S  (x) 
H  JU1  liHstees 

P  (enlistee  will  attrite)  =  P(X<48  mos.) 

*  1-P(X>48  mos.)  «  1-S(x) 

P (enlistee  will  complete  1  term]  a  P(X=4  8  mos.) 

*  Ht.  of  jump  at  x=48 

P  (enlistee  will  reenlist)  =  P(X>4  8  mos.)  » 


1-S  (x) 


These  realization  times  for  the  random  variable  X  are 
chosen  just  to  demonstrate  the  survivor  function  method- 
clog;.  Considerable  research  has  been  done  in  defining  the 
exact  tiie  cutoff  for  the  definition  of  attrition,  since  the 
first  Mterm"  of  service  ma;  actually  be  thirty-four  months 
as  opposed  to  thirty-six  because  of  the  various  "early  out" 
programs  offered  by  different  commands.  This  analysis  does 
allow  for  at  least  this  concept  in  its  definition  of  attri¬ 
tion  as  strictly  less  than  thirty-six  and  forty-eight  aenths 
(i.e.  thirty-five  and  forty-seven  months)  length  of  service 
for  the  three  year  and  four  year  obligations  respectively. 
The  methodology  suffices  no  matter  where  the  attrition  defi¬ 
nition  is  made  in  the  length  of  service  parameter. 

Survivor  functions  for  each  of  the  previously  defined 
candidate  explanatory  variables  are  estimated  utilizing  the 
Cumulative  Distribution  input  screen  in  the  IBB  GBAFSTAT 
data  analysis  package.  (See  Appendix  J  for  a  depiction  of 
this  screen.)  These  functions  are  analyzed  for  the  above 
mentioned  statistics. 

The  survivor  function  of  the  entire  sample  across  all 
variables  is  presented  to  demonstrate  the  characteristics  of 
the  analysis  in  Figure  4.1  •  Using  this  type  of  analysis, 
statistics  for  the  nest  prevalent  enlistee  are  presented  in 
Figure  4.2  and  in  Figure  4.  3  for  3  year  enlistees  and  4  year 
enlistees  respectively.  The  "most  prevalent  enlistee"  was 
determined  by  observing  the  total  number  in  each  of  the 
seven  variables  being  considered.  note  that  the  four  year 
obligated  enlistees  demonstrated  higher  probability  of 
attrition  (0.33  to  0.24),  and  a  lover  probability  of  reen- 
listment  (0.12  to  0.37)  than  the  three  year  obligated 
enlistee,  ceteris  paribus.  This  may  indicate  that  the 
utilization  of  the  three  year  term  of  service  is  more  cost 
effective  than  the  four  year  term,  considering 
"assess,dress, train"  cost  described  earlier.  Results  for 
the  nest  prevalent  enlistee  are  summarized  in  Table  IX  . 
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SAMPLE  GRAPHICAL  CALCULATIONS 

P(LENGTH  OF  SERV1CE-36MONTHS) 

-HEIGHT  OF  JUMP  AT  LOS- 36  MONTHS 
0.559-0.320-0  239 

P{ LENGTH  OF  SERVICE<36MONTHS) 

-VERTICAL  DROP  IN  CURVE  FROM  0  TO  35  MONTHS 
1.0-0.559-0.441 

P(LENGTH  OF  SERV»CE>36MONTHS) 

-VERTICAL  DROP  IN  CURVE  FROM  37  TO  59  MONTHS 
0.321-0.000-0.321 


Figure  4.1  Survivor  Function  For  All  3  10  Enlistees 

The  effects  of  education  level  on  attrition  of  tbree- 
year-ctligated  enlistees  are  presented  in  Figure  4.4  . 
Those  ezlistees  with  a  graduate  equivalency  certificate 
indicate  a  higher  probability  of  attrition  (0.54)  than  both 
those  vith  tvo  years  of  high  school  and  those  with  ncn-bigh- 
schocl-diplcma  graduate  (NHSDG)  status,  having  three  to  four 
years  of  high  school.  These  findings  reinforce  the  earlier 
hoxplot  analysis  of  length  of  service.  The  trend  is  also 
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Figure  4.2  Survivor  F auction.  Host  Prevalent  3X0  Enlistee 

evident  in  the  probability  of  reenlistaent  with  j  the  bigh- 
school-diplcaa  graduates  having  the  highest,  followed  by  the 
HBSDG,  then  those  with  2  years  high  school  and  then  those 
enlistees  with  equivalency  certificates.  Again  the  GED 
enlistee  is  seen  tc  be  inferior  to  the  non  high  school 
diploaa  graduate  (HHSEG).  fiesults  of  this  survivor  function 
analysis  are  in  Table  X  in  the  following  section. 


Figure  4.3  Survivor  Function,  Host  Prevalent  410  Enlistee 


TABU  II 

fiesults  of  Survivor  Analysis  on 

Enlistee 

Host  Prevalent 

Tern 

P(attrite) 

P(fuli  tern) 

P(Beenlist)  Ave  LOS 

3 

4 

Tears 

Tears 

0.242 

0.334 

0.365  " 

0.545 

0.373  35.53 

0.121  40.94  , 

i  n  n  n  n  •  rt  n  ro  ro  ro  o  3  n  n  k  ro  ro  « 
junoraoM  "MNtns  unatmu  iwMwms  AimeeoM*  twwmts 

Figure  4.<l  Survivor  Functions  of  Education  Levels 


C.  BESOITS  OF  S UBVIVCB  FUNCTION  ANALXSIS 


Similar  analyses  were  performed  on  the  remaining  six 
candidate  explanatory  variables  for  both  the  three  year 
enlistees  and  for  the  four  year  enlistees.  Actual  survivor 
functions  are  found  in  Appendix  K  and  L  respectively. 
Tabular  summaries  are  provided  in  Table  X  and  XI  .  Based  on 
there  summary  tables,  the  lowest  risk  attributes  in  terms  of 
lcw'Sv.  i-robability  of  attrition  are  provided  in  Table  XII  . 
Those  attributes  that  result  in  the  highest  probability  of 
reenlistment  are  shown  in  Table  XIV  and  Table  XV  . 

The  usefulness  and  simplicity  of  this  approach  is 
appealing  for  11  quick  and  dirty"  analyses  in  determining 
short  term  policy  decisions  concerning  the  char acteristics 
desired  for  prospective  soldiers. 

Modeling  of  the  survivor  curves  can  provide  further 
prediction  capabilities  and  significance  of  each  of  the 
covariates. However,  the  survivor  functions  presented  all 
demonstrate  the  large  "jump"  at  thirty-six  and  forty-eight 
months  for  the  three  year  and  four  year  enlis¬ 
tees, respectively.  This  large  "discontinuity"  causes 
modelling  of  the  survivor  function  to  be  somewhat  difficult. 
Modelling  of  the  survivor  is  discussed  in  great  detail  in 
£Bef.  19]  and  [Hef.  20].  The  noted  similarity  in  the 
survivor  curves  for  all  the  variables  under  investigation 
suggest  that  the  use  cf  a  Cox  proportional-hazards  mcdel  nay 
be  appropriate;  however  more  research  is  needed  in  the 
modeling  of  the  discrete  jump  in  the  survivor  curve.  See 
£Bef.  20].  This  analysis  has  been  presented  as  an  initial 
methodology  to  demonstrate  its  usefulness  ;  no  modelling 
will  be  performed. 
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TAB1E  Z 

Be suits 

of 

Survivor 

Function  Analysis  for 

3IO 

Variable 

P (ICS<36) 

P  (LOSs 36) 

P  (LOS>36) 

Ave.  LCS 

All  variables 
Education  lvl 

0.441 

0.239 

0.321 

30.87 

1  jr  High 

1  VIS  HS 
GEfi 

Sch 

3.778 

0.111 

0.111 

20.11 

3.503 

0.201 

0.296 

28.72 

3.542 

0.  184 

0.274 

27.09 

HHSDG 

3.461 

0.217 

0.322 

30.69 

HSDG 

3.366 

0.283 

0.351 

33.23 

College 

0.400 

0.400 

0.200 

32.80 

Sex 

Bale 

3.431 

0.246 

0.322 

31.17 

Feiale 

Bace 

ihite 

0.515 

0.175 

0.310 

28.39 

0.467 

0.258 

0.275 

29.21 

Elack 

ether 

flental  Cat 

0.405 

5.374 

3.357 

0.204 

0.238 

0.214 

0.391 

0.388 

0.429 

33.18 

34.02 

33.57 

Cat! 

Cat2 

3.401 

0.294 

0.305 

31.85 

Cat3a 

0.356 

0.320 

0.324 

32.40 

Cat3b 

0.457 

0.225 

G.318 

30.42 

Cat  4a 

0.449 

0.230 

0.321 

2  S  .86 

Cat4b 

0.468 

0.222 

0.310 

30.80 

Cat  4c 

0.443 

0.193 

0.364 

32.57 

Bar.stat/No. 

Deps 

i 

3.441 

3.448 

0.252 

0.083 

0.307 

0.469 

30.62 

32.92 

Barrie d/1 
Barrie  5/2 

1 

1.563 

0.000 

0.437 

30.19 

1 

1.381 

0.095 

0.524 

36.76 

Barrie d/3 

1 

3.333 

0.083 

0.583 

42.00 

Silitary  Skill 

0.225 

0.314 

30.28 

1 1B 

0.461 

13B 

( 

1.464 

0.195 

0.341 

30.73 

31H 

£ 

1.408 

0.241 

0.351 

31.07 

64C 

C 

1.419 

0.302 

0.279 

31.18 

761 

c 

.402 

0.217 

0.381 

32.82 

SOB 

l9l7 

c 

1.428 

C.260 

0.312 

30.85 

c 

1.491 

0.215 

0.294 

29.65 

18 

c 

.432 

0.255 

0.313 

31.04 

19 

{ 

.396 

0.266 

0.338 

32.09 

c 

.482 

0.232 

0.286 

28.92 

£ 

.405 

0.250 

0.345 

31.61 

22 

G 

.495 

0.206 

0.299 

28.80 

23 

£ 

.446 

0.  149 

0.405 

33.15 

24 

G 

.450 

0.225 

0.325 

32.18 

25 

£ 

.470 

0.  118 

0.412 

32.77 

26 

£ 

.556 

0.074 

0.370 

25.04 

27 

C 

.450 

0.150 

0.400 

30.80 

28 

C 

.364 

0.182 

0.454 

31.10 

29 

C 

.571 

0.  143 

0.286 

27.00 

30 

C 

.286 

0.000 

0.714 

53.14 

>  30 

0 

.524 

0.238 

0.238 

25.86 

Sesults  of 

TABLE  XI 

Survivor  Function  Analysis  for 

4IO 

Variable 

P  (ICS<48) 

P  (L0S=48J 

P  {LOS >4 8) 

Ave.  LCS 

All  variables 

0.441 

0.239 

0.321 

30.87 

I  Education  lvl  1 

1  vr  High  Sch 

— 

*• 

— 

— 

2  vrs  BS 

GED 

0.670 

0.110 

0.230 

v  31.78 

0.666 

0.  167 

0.167 

29.67 

NHSEG 

0.500 

0.125 

0.375 

35.88 

BSDG 

0.367 

0.409 

0.2  23 

39.18 

College 

0.600 

0.240 

0.160 

28.96 

sex 

1  Hale 

0.391 

0.389 

0.220 

38.29 

I  I e Bale 

0.667 

0.000 

0.333 

37.00 

i  Bace  1 

Shite 

0.387 

•  0.389 

0.220 

37.74 

Black 

0.424 

0.288 

0.288 

39.36 

ether 

0.307 

0.500 

0.  193 

39.42 

1  Mental  Cat  i 

Cat  1 

0.333 

0.429 

0.238 

41.14 

Cat2 

0.376 

0.36  8 

0.256 

38.63 

Cat3a 

0.324 

0.485 

0.191 

3S.78 

Catlb 

0.427 

0.427 

0.146 

37.29 

Cat  4a 

0.394 

0.346 

0.260 

38.10 

Cat4b 

0.465 

0.296 

0.239 

36.66 

Cat4c 

«... 

—  — 

I  Har.Stat/No.Deps 

SinqleyO  r  0.393 

0.402 

0.205 

37.88 

flarrg.ed/0 

0.550 

0.150 

0.300 

38.85 

Barrie d/1 

—— 

— — 

— — 

Harried/2 

— 

— 

—  — 

— —  | 

I  Military  Skill  ] 

11B 

C.359 

0.412 

0.229 

39.43 

33b 

0.434 

0.352 

0.214 

36.73 

31H 

mmmm 

— • 

€4C 

— 

—  — 

761 

— — 

— — 

—  — 

— - 

S4B 

Age 

17 

" 

—  — 

0.418 

0.436 

0.145 

36.86 

18 

0.349 

0.429 

0.222 

39.56 

19 

0.353 

0.431 

0.216 

39.12 

20 

0.797 

0.018 

0.185 

28.92 

21 

0.455 

0.394 

0.151 

36.72 

22 

0.476 

0.238 

0.286 

38.19 

23 

0.662 

0.040 

0.298 

33.15 

24 

0.675 

0.050 

0.275 

32.18 

25 

0.647 

0.000 

0.353 

32.77 

ii 

0.500 

0.125 

0.375 

33.50 

—  — 

— 

» —  I 

29 

— 

—  - 

— - 

30 

— — 

— 

—  — 

— 

>  30 
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TABLE  III 


Attribute  Values  Providing  Lowest  Bisk  of 

Attrition,  3X0 

Variable  Lowest  E  (attrition)  Next  lowest  P(  attrition) 

Education 

level 

HSI6 

College 

Sex 

Bale 

Feaale 

Bace 

Otter 

Black 

Bental  Cat. 

3A 

1 

Barit al  Stat/ 
He.  of  Deps 

B/2 

S/0 

Age 

28 

19 

BCS 

761 

3tB 

TABLE  EXIT 

Attribute  Values  Providing  Lowest  Bisk  of 
Attrition, 4X0 

Variable  Lowest  P (attrition)  Next  lowest  P( attrition) 

Education 
— Level 

HSIG 

HHSDG 

Sex 

Bale 

Feaale 

iMCe 

Otter 

White 

Bental  Cat. 

3A 

1 

flarital  Stat/ 
Ho.  of  Deps 

S/0 

B/0 

Age 

18 

19 

BCS 

1 1B 

31B 

8 


Attribute 

Variable 

TABLE  XIV 

Values  Providing  Highest  Beenlistoent,3  70 

Lowest  P  (reenlist)  Next  lowest  P  (reenlist) 

Education 

Level 

HSEG 

HHS06 

Sex 

Hale  / 

Fe  Bale 

Hace 

Black 

Other 

Hental  Cat. 

1 

4C 

Harital  Stat/ 

He.  of  Ceps  H/2 

a/2 

Age 

30 

28 

HCS 

767 

| 

31H 

i 


Attribute 

Variable 

i  TABLE  XV 

i 

Values  Providing  Highest  Beeniistaent,470 

lowest  P(reenlist)  Next  lowest  P  (reenlist) 

Education 

Level 

HHSLG 

2  yrs  HS 

Sex 

Feaale 

Hale 

Bace 

Black 

White 

Hental  Cat. 

4A 

2 

Harital  Stat/ 

Ho.  of  Deps  fl/0 

S/0 

Age 

26 

25 

HCS 

1 1E 

13B 
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1.  GEHEBAI 


7.  BESPITS  ASP  CONCLUSIONS 


Ail  intuitively  pleasing , si »ple  methodology  vas  presented 
for  the  study  c£  performance  in  the  form  of  length  of 
service  of  0.S  Army  enlistees.  Some  Exploratory  Data 
Analysis  techniques  vere  demonstrated  through  the  use  of  the 
IBM  GBAFSTAI  data  analysis  package.  The  interactive  capa¬ 
bilities  of  the  package  and  the  APL  language  were  exploited 
to  prcvide  a  means  of  rapidly  manipulating  and  observing  the 
selected  data.  He  tools  proposed,  the  draftsman's 
displays,  hoxplots  and  survivor  functions,  vere  used  on 
actual  cohort  data  frcm  the  Defense  Manpower  Data  Center. 

Several  possible  explanatory  variables  and  their  associ¬ 
ation  with  performance  vere  presented  based  on  the 
Exploratory  Data  Analysis.  Confirmatory  analysis  vas 
performed  to  suppcrt  the  Exploratory  Data  Analysis. 
Probabilities  of  enlistee  attrition  and  reenlistment  vere 
provided  using  a  survivor  function  analysis  for  each  of  the 
candidate  explanatory  variables.  Attributes  that  presented 
the  highest  risk  of  attrition  and  the  highest  probability 
vere  presented. 

B.  SDHHABI 

The  increasing  ccst  of  "assessing, dressing  and  training" 
today's  Army  enlistee  coupled  vith  the  diminishing  supply  of 
17-21  year  old  prospective  enlistees  have  prompted  research 
effort  tcvard  gaining  insight  into  those  personal  attributes 
that  produce  the  most  successful  soldier  in  terms  of  first 
term  completion.  The  basis  for  understanding  the  relation¬ 
ships  of  these  personal  attributes  and  for  using  this 


understanding  in  recruiting  policy  lies  in  the  ability  to 
rapidly  analyze  the  available  data  on  current  enlistees  and 
to  present  the  analysis  in  a  form  that  is  understandable  and 
useful  for  the  decision  maker. 

This  thesis  has  presented  a  broadly  applicable  and 
simple  methodology,  using  Exploratory  Data  Analysis  through 
the  interactive  capability  of  the  IBS  GHAPSTAT  package  and 
the  API  language,  for  defining  the  area  of  analysis,  identic 
fying  errors  in  the  data,  reducing  the  dimensionality  of  the 
problem,  and  determining  relevant  association  of  personal 
characteristics  of  enlistees  to  performance.  The  bonds 
between  Exploratory  Data  Analysis  and  more  classical  statis¬ 
tical  analysis  were  demonstrated.  Use  of  survivor  function 
analysis  provided  statistics  on  chosen  explanatory  vari¬ 
ables,  indicating  the  importance  of  these  characteristics. 

The  further  application  of  the  methods  in  this  thesis 
and  of  Exploratory  Data  Analysis  in  general  should  increase 
the  practitioner *s  ability  to  make  sound  decisions  regarding 
future  manpower  planning  issues.  With  the  increased  avail¬ 
ability  of  graphics-capable  personal  computers.  Exploratory 
Data  Analysis  is  relevant  at  all  levels  of  decision  making. 

C.  BZCGBBZ1DED  FUBTBIB  BBS EABCH 

The  following  items  deem  further  research: 

1.  A  comparison  of  Exploratory  Analysis  technigues  in 
this  theses  to  other  data  analysis  packages  such  as 
those  available  in  BBEP  and  SAS  (see  [Bef.  17]  and 
[Bef.  21]  )  vculd  be  useful  in  determining  advantages 
and  disadvantages  of  the  different  approaches  in 
variable  selection  and  error  identification.  In 
particular,  the  Cox  proportional  hazards  model  in  the 
BBDP  program  p21  £Hef.  17:  pp.  576-594]  uses  a  step¬ 
wise  approach  to  identify  important  explanatory 
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variables  and  estimates  the  survivor  as  well  as  the 
hazard  function  for  further  analysis.  Noce  however 
that  models  such  as  this  Cor  proportional  hazards 
model  for  estimating  the  effect  of  concomitants  on 
survival  curves  are  not  applicable  because  of  jumps 
is  the  survival  functions  at  known  times.  Here 
research  is  seeded  to  determine  how  to  apply  the 
model  to  such  a  function  with  this  large  discrete 
jump. 

2.  Ihe  Graduate  Equivalency  Degree  programs  offered 
throughout  the  Osited  States  need  to  be  analyzed  in 
detail  for  standards  used  in  awarding  the  certifi¬ 
cates.  The  wide  variance  and  the  poorer  performance 
of  the  GED  holders  indicate  that  non-high-schocl- 
diplcma  graduates  should  be  treated  separately  in  any 
analysis,  contrary  to  the  popular  grouping  of  the  two 
categories.  Perhaps  different  GED  levels  would 
provide  insight  into  future  performance  at  least  as 
well  as  the  different  levels  of  high  school  status 
have. 

3.  Ihe  trends  and  probabilities  have  been  presented  as  a 
methodology  on  only  a  10  percent  sample  of  the  data: 
comparisons  of  these  outcomes  to  other  data  sets 
would  be  useful  in  the  determination  of  prediction 
possibilities. 

4.  flodeling  of  the  survivor  curves  would  provide  a 
detailed  account  of  the  actual  contribution  of  each 
explanatory  variables  using  multivariate  regression 
techniques.  Again  further  research  is  needed  in  the 
applications  of  modeling  techniques  to  survivor 
curves  with  the  noticeable  jumps  at  known  times. 
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EXPLOB ATOEY  DATA  AHA1YSIS  TECHNIQUES 


Exploratory  Data  Analysis  techniques  are  usually  first 
attributed  to  Jchn  i.  Tukey  in  his  book  by  that  title 
£Bef.  22].  Exploratory  Data  Analysis  for  the  purposes  of 
this  thesis  will  be  defined  as  "the  activity  of  exaaining 
data,  both  graphically  and  through  numerical  summaries,  for 
the  purpose  of  revealing  properties  of  the  data  itself,  and 
with  luck,  of  the  processes  giving  rise  to  that  data." 
[Bef.  23:  p.  2].  Thus  EDA  techniques  can  be  thought  of  as 
"informal  "  techniques  to  examine  the  data  prior  to 
"formal",  more  classical  analysis  techniques,  in  order  to 
prevent  needless  calculations  irrelevant  to  the  investiga¬ 
tion  at  hand.  Quite  often  more  can  be  learned  about  the 
data  in  this  initial,  informal  look  at  the  data.  As 
Chambers  et.  al.  points  cut,  graphical  EDA  methods  are 
perhaps  most  effective  in  the  initial  glance  at  the  data  to 
limit  the  scope  cf  the  investigation  to  only  those  variables 
that  are  pertinent  [Bef.  12].  These  graphical  methods  allow 
the  investigator  to  rapidly  synthesize  information,  in  a 
more  efficient  and  intuitive  manner  perhaps  than  through 
methods  available  in  commercial  statistical  packages  that 
produce  tabular  data. 

Cne  particular  lethod  of  multivariate  analysis  is  the 
multidi sessional  array  of  scatter  plots  called  a  "general¬ 
ized  draftsman’s  display"of  the  data  [Befi  12:  pp.  136,145]. 
An  exaiple  display  is  seen  in  Figure  A. 1  This  figure  demon¬ 
strates  how  the  pairwise  scatter  plots  are  arranged  so  that 
"any  adjacent  pair  of  plots  have  an  taxis  in  ccamcr." 
(Bef.  12:  p.  145].  All  variables  of  interest,  then,  for 
the  entire  data  set  can  be  displayed  as  tne  first  phase  of 
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the  investigation.  Then  one  can  rapidly  and  effectively 
determine  if  trends  exist  and  for  which  specific  pairwise 
association  of  variables. 

Captain  Halcolm  Johnson,  a  student  of  Operations 
Hesearch  at  the  Naval  Postgraduate  School,  has  developed  an 
APL  progiaa  called  "drafts nan"  that  is  imbedded  is  IBS's 
GBAFSTAT  package  on  the  school's  computer  system  (See 
[Bef.  24:  pp.  13-17].  for  information  on  use  of  GBAFSTAT) 
that  organizes  any  data  set  into  a  draftsman' s  display. 
This  program  also  allows  for  transformations  of  the  data  and 
for  jitterii  7  of  the  data.  His  efforts  have  been  published 
as  a  Baster's  thesis  that  includes  a  tutorial  for  use  of  the 
"draftsman"  program.  [Bef.  13].  This  program  will  be 
utilized  in  the  initial  phase  of  the  data  analysis  efforts 
of  this  thesis. 

Cf  course,  the  draftsman's  display  is  only  the  first 
step  cf  the  analysis.  If  trends  are  evident,  then  further 
analysis  should  be  performed  utilizing  more  formal  confirma¬ 
tory  analysis  to  verify  any  graphically-determined  associa¬ 
tions  among  the  variables. 

The  use  of  boxplcts  is  another  EDA  technique  that  is 
very  useful  in  "taking  an  initial  look"  at  the  data.  The 
boxplct  is  a  "simple  method  of  summarization". 

The  upper  and  lower  guartiles  are  depicted  by  the  "body" 
cf  the  box,  the  median  is  portrayed  by  a  line,  circle  or 
other  distinguishing  mark  as  is  the  mean.  Upper  and  lover 
adjacents  are  depicted  at  the  end  of  lines  extending  from 
the  body  of  the  box.  These  terms  are  defined  as  the 
"largest  observation  that  is  less  than  or  equal  to  the  upper 
guartile  plus  1.5  times  the  interquartile  range,  and 
smallest  observation  that  is  greater  than  or  equal  to  the 
lover  guartile  minus  1.5  times  the  interquartile  range," 
respectively.  Values  that  fall  outside  the  range  cf  adja¬ 
cent  values  are  called  outside  values.  These  are  plotted  as 
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individual  joints.  See  Figure  A. 2  for  a  depiction  of  the 
components  of  the  boiplot. 

The  hoi plot  provides  a  rapid  "impression"  of  the  distri¬ 
bution  of  the  data.  Ihe  median,  mean,  and  spread  are  all 
obvious.  Ihe  length  of  the  lines  to  the  adjacent  values 
demonstrate  the  "stretch"  of  the  tails  of  the  distribution. 
Ihe  individual  points  for  the  outside  values  allows  the  user 
of  the  plot  to  consider  "outliers"  although  not  every 
outside  value  is  an  outlier. 

Ihe  figure  also  allows  for  some  determination  of  the 
symmetry  of  the  distribution  of  the  data,  simply  by  viewing 
the  symmetry  of  the  body  of  the  box  about  the  median  line  or 
dot. 

Ihese  plots  are  useful  when  it  is  not  feasible  or  neces¬ 
sary  to  capture  all  the  details  of  a  distribution,  or  when 
many  distributions  need  be  compared.  The  width  of  the  box 
has  no  significance. 

In  excellent  discussion  of  these  and  other  EDA  tech¬ 
niques  is  found  in  £Bef.  12],  from  which  this  description  of 
boxplots  was  taken. 
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SAMPLE  BOX  PLOT 

PLOT  OF  50  POINTS 


•  _  iPPER  QUARTILE  •  1.5*  INTERQUARTILE  DISTANCE 

o 

_  LARGEST  VALUE  i  LPPER  QUARTILE  *  INTERQUARTILE  DISTANCE 

JL  _  UPPER  QUARTILE 

_ KCDIAN 

_  LOWER  QUARTILE  ‘  * 

SMALLEST  VALUE  »  LOWER  QUARTILE  -  INTERQUARTILE  DISTANCE 
LOWER  QUARTILE  -  l.S* INTERQUARTILE  DISTANCE 

;  INTERQUARTILE  DISTANCE  *  LPPER  QUARTILE  •  LOWER  QUARTILE 


O 

t 


r 

i 
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•  Zxaapla  bos  plot  for  fifty  data  polata  free  a  rafaaaratlva 
alaulatloa.  Tha  lntarquartlla  dlataaea  equals  tbu  estimated 
■Wat  qwartlla  alaua  tha  aatlaatad  lower  quart Ua.  Tba  light 
elrclaa  ara  data  points  which  fall  batwaaa  tha  Urgeat  value 
laaa  than  or  equal  to  tha  oppar  quart 11a  plus  tha  iater- 
quartlla  dlataaea  aad  tba  uppar  quartlla  plus  l.S  tlaas  tba 
latarquartlla  dlataaea.  Tha  dark  elrelaa  ara  data  with  values 
•have  this  latter  point .  Similarly  for  tha  lower  part  of 
the  ho*  plat. 


figuxe  A. 2  Exaaple  Boxplot 
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43= 

1 

3.2 

9933 

70.2 

423 

1.7 

9341 

71.2 

• 

42T 

W 

e«4 

72.: 

•  • 

422 

42 

1314 

72.2 

43v 

51 

5366 

72.7 

72.7 

433 

1 

3.2 

5257 

443 

.  1 

2.2 

5  364 

13.7 

44C 

441 

344 

* 

?:? 

n:i 

•  •  •• 

i»l 

i 

?•* 

9524 

5539 

7«:i 

4  !*♦ 

5 

4  •  ? 

5»*3 

73.2 

47N 

47 

3.5 

15710 

7c.i 

• 

47T 

4 

13214 

76.  r 

■  7U 

31 

1.2 

10743 

i«. : 

4  7V 

44 

1.3 

1335  7 

73.4 

47* 

a 

l.i- 

.30  *4 

79.4 

Iff 

3.3 

10124 

79.7 

453 

15 

1.1 

15140 

73.4 

453 

14 

:.i 

13136 

73.5 

43* 

J 

3.1 

1C  164 

31.1 

41G 

53 

;•  4 

-C21  4 

:i.l 

531 

12 

:.i 

13226 

*:.* 

6d«l 

n 

3.3 

K2««* 

3C.7 

44“ 

:.i 

13271 

31.9 

7 1C 

i 

’•i 

17172 

»C.= 

713 

■* 

13274 

713 

:.i 

1*27  2 

iC.« 

IlL 

*4 

J  •  5 

13347 

*1.4 

• 

71*4 

u 

:•! 

15237 

51.3 

71*1 

5 

3.2 

11353 

4  1.6 

. .  .  . . .  . 

71° 

n 

i.3 

;C2'4 

72£ 

Si 

12444 

?2j 

t:: 

34 

1  345  J 

«!.? 

13 

••1 

.34*1 

32.6 

I2S 

1 

10441 

92.4 

742 

2 

134?; 

?2.6 

743 

4 

1*49? 

*2.4 

74* 

1  =  455 

*2.4 

753 

75 

3.4 

15573 

-3.5 

• 

752 

11 

:*2 

i359e 

*2.4 

7*3 

»is 

i: 

:.7 

12641 

M.l 

# 

2) 

5  •  £ 

13710 

*4.3 

92 


11  m  n  vv*.";'*"*-1  "• l.1".1.1.1.  1  T 


TA3L*  1C.  1 

•33  CISm9UTI3N  FCR  »R*>Y  KPS  KCC93S IONS. 


•an- 

ero. 

Cl*. 

I2U». 

1  1  2  2  2  2  4 

cm: 

:?nt 

eatc. 

73F 

9 

3.2 

1C  714 

*  +  .Z 

73  J 

i 

1.C 

10713 

r9.1 

763 

1 3 1 

l.l 

33936 

33.4 

• 

7  S3 

1 

3.0 

1363  7 

33.3 

76J 

12 

:.i 

10  =  77 

33. £ 

76L 

l 

lO.i  71 

•i.S 

763 

43 

7.3 

10311 

65.3 

7SU 

1 

7.2 

10=12 

■>3.c 

7SV 

JS 

3.7 

33363 

36.9 

• 

764 

11 

3.7 

113  29 

’7.3 

• 

764 

2 

o.c 

11361 

87.2 

167 

251 

5.c 

11244 

35.2 
'  85.2 

•  « 

eie 

3 

21249 

•1C 

2 

3.3 

21331 

89.2 

32C 

133 

3.3 

11460 

90. Z 

« 

32= 

1 

2«C 

11461 

92.2 

33F 

2 

a.i 

11462 

92.2 

849 

1 

3.3 

11969 

30.2 

=  4C 

1 

3.3 

11463 

•c.z 

■H 

1 

J.O 

11966 

3C.5 

S  1 A 

l 

11467 

32.2 

919 

»>T 

2.6 

11734 

52.3 

9 1C 

ii 

:•  i 

:!•* 

313 

9 

1.2 

1 19*5 

52.= 

31-3 

>9 

'3.2 

!J  -33 

5  3*  1 

i 

11.-34 

53.1 

UH 

9 

3.3 

11=  23 

53.2 

5  it. 
31? 

i 

16 

3.2 

3.1 

1  1629 
21=33 

’3.2 

33.2 

?ir 

21336 

13.2 

321 

1 

3.: 

21!  17 

9  3.2 

3  33 

1 

3.0 

11=36 

*3.2 

539 

11 

7.; 

1  1665 

52.  * 

33H 

3.2 

113  72 

93.  4 

3  34 

• 

3.2 

1  1  =  73 

53.3 

549 

3? 

3.1 

12273 

56.6 

•  •  • 

1 

3.3 

12274 

56.6 

33* 

12 

3.1 

12236 

1«.7 

Bi.,a 

237 

3.3 

22673 

55.  / 

■ti-9 

26 

2.2 

I2S39 

102.1 

17C.2 

363 

2 

3.  3 

2271’: 

1 

3*3 

12722 

172. * 

554 

1 

.2722 

17  :.2 

334 

2 

3.: 

.2703 

130«  J 

A£iI2JU  c 

?0HTB1I  PBOGHAHS  10  BEAD  DATA 


//DTHUHAS  JCc  11472,0.>9E),'  a.TnJPAs*  ,C*.Aii«o 
//■•MAIN  0*G«XPGVHl.i*72f 
//  EXEC  PGA IaCG 
//FUHI .SYS  IN  JO  « 

INTEGER**  SSAM,iSA,<Z,SiANj, 

4  UuSYC  iJGSrto  tu  oSCL  iuc^Y  2  ,CwaHS,  JCioS 


UuSYC , JCirtLi  .uuiCC  tuoY  j|i 
lNTfcucA**  oLArNK.Nlue 
0«TA  ELAM/'  •/,!«  INEZ* 


OC  300  1-I.J077B, 10 


■V'/ 


3  SAMI 
SSAN2 
S  SAN  3 
UCSYC 
OCSrtO 
3CSOC 

8 OSH  I 
QCSOS 


•  olANX 
■  ttG«‘«A 

•  Mile 

-  -9 
a  -9 
a  -9 

-  -9 

•  -9 
a  -9. 


RE  AO 
00  SY 


300 

100 

200 


( 1 1 10J I  SSAHl  1 3  S  AN  3  illANliu QSYu  iCJSHC, JGSwC , 
S.OGSHi.JOSCS 

hR1TE(2.20J>  SS  an  1«  SSA.NEuSANi,  OGSYCmUSHUiOuSCGi 
•  OOSYS.OCirtS.GGSCS 

c9nt.ncJ 

FCRHATI3A3.1e9A,I2.J2,12>97A.12.12.i2>  - 

FOfti AT (3aa«1a ,  1 2  • 1A >1 2 ■ IX 1 12 1 1  Ail  2  •  AA 1 12  • lAt 121 
cNO 


//6C»f T01F001  CC  0alT«J33OV,»OiiaScHaN302<>2.0UPaiOl.0>> 
//  JCo-(RECPH«P6il.RECL-32oi»i.xi»2e*12  71iI  , 

//  gSRAHE«PSi.SW72.CPT79 

y/50.FT02P001  00  J.«1T- J2aO.  V0l.aSE2-HVS00a *>  1  SP« t  SH*  1 1 
//  SPACca(CYL,(4,9ll.0Caa(AECF  ‘  '  “ 

0SRAHE«S14 12.4790 


// 

// 


ECFHarbtLX ECL- 27, olxS12c«1 4092  1 1 
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vAh 


S  TnAl  L 
■.FAS'  .a 


C  This  FRGLRAH  STRIPS  TFoSE 
C  HENT. 

//OThiIHAS  JCE  t  1972, f*  «..Tn 
//bPAIN  CRO«NPLVMi.l972F 
II  EAEC  FLRUbti 
7/FGKT.SYSlN  4.0  <* 

INTEGER**  SSA  l,3S».4i.SSAi«i, 
*  TaFnC  .mYcCU  •  ?4*0  4MA4. 1  Cel  -  . ;  lb  fut 
A  REG  v  T*FF3 t  HY  ECS  t7vj  .PaS.icPb.ci 
RE  S  •  T  AFFL  1 HYcLL  .PC*  1  PS*  ,  Oct'1  • 
c  TSrtL  .CHS  YL,«  El 
1*9  oLANA  ,14  iric 
LANK/'  '/ ,n INC/ *  -9*/ 
'it SJ77 St  10 


NHiMt  4.«ch  Ths  T  iHN  bF  ENLIST 
ASs*o 


eT  SYL  ti 
IN  I E  lici 
LATA  EL 
00  aOO  1 


£  f  iiuiwHittjt 

AYAtETStAStLnSYAs 


S  SAN  1 

*«  <JLA^ 

SSAN2 

•  ULmNA 

SSA.A3 

•  NiftC 

TaFnC 

•  -9 

HYECu 

«  -9 

»Gl 

»  -9 

NSC 

«  -9 

OEPO 

m  -9 

ETSYC 

■  -9 

£  TSNC. 

•  -9 

ChSVC 

■  -9 

REU 

*  -9 

TAFNS 

«  -9 

HYECS 

-  -9 

5§! 

ETSNS 

ChSVS 

RES 

TAFNL 

«£Ct 

Si  pl 

ETSYL 

Itsml 

ChSYL 

REL 


-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 

-9 


300 


REACH. 100!  SSAM.SSANE 
OePU.cTSYu.ETSHC.CnSvO, 
TAFHS.HYECS.Pbi  .NSi  .ucP 
TAFNL  .MYECL. POL  .nSL  ,b£P 

4.RITEU,2001SSAM,SSAi42 

Oc  PC*  ,fc  TSYu,  c*  ,cn S*u  , 

TAFH3,nVSC$»PG$  tn3&  ,w£P 
TAFttL  »HY£Ci  ,  POL  ,** L  ,*£»> 
CNT  1M£ 
loP 


t£iANJ,T  iPNU  thYcUoiPCwtHSoi 
rfkw* 

SkIoYo!  I 
LtiTStc, 


t  cTSrt  StCnSvS#  ac.S* 

!  cT^rtc  foniVLi  acl 


Af  fV*  *H7  iCvo#T  »U*rtSu» 
s  *cf sr$» t 

LtcIitL, 


i  cT>^i»CrtSrfii^|S» 
>  £  tcniWi.  ,  Ktw 


100  F  G RM  AT  1 3 A3  *  1  <,2X  ,  1  3  ,  o  <v ,  12  , 

•  1  lAt  I3fUil2iU(  UtU  •  1 2  , 

*6Xti  2  1  lo  fl?X  tu  1 


290  FORMAT  UA3tlA, 

vjjtlXf 12i 1X| i 2i  IX  ,12, IX* l 
•I  2.1X,  U,lA,l3t  IX  ,12,  IX,  1 
-*'4 


IX  , i 2,  1a*  i 


1 $ » IX ,11 1 141 1 7X , 12, 12, *A  v lit  11, 
1  iX  ,  1  2,1  2i*»a  *11,11,1  1a,  13, 
till 

2 , 1A, 12 r IX, l 2, IX, 

2 , i A  «  12, 1a, 

2 , 1a  ,  13,  1 A •  1  2  •  1a  •  12  ,  lx  , 

2 , 1a  , 


/» 


ENO 


«5£R«*a0  2d2,  C  I<jP*(wkO)  , 

AllAfe-UU-tl  • 


//GC.FTC1F  001  DC  *ilT»3230v  ,Vut. 

//  0Cfc«l A£CF«*F£,  Ut£C C*32 6,dL 
/✓  0SKAM6*PSi. 51*72.0779 

//GO •FT  Q2F  001  uO  0*4 1 T *  j 350,  vCc»S  £k«HV iO 04 ,0  1  $P«  ( 5HK  1  .» 

//  SPACEMCYL  )  •  uCd*  IA£GPP«FS,L!<  fcC.*.«9i,QLftii2E*l9Jup  1  , 

//  5 S NAM £•« 1972.1 7? A 

// 
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i  ?*ii  9  «<><+*  A*  *  A*  A*,*  A  (ol  Hi  fy  ww)<v  -  ‘  I 

//,)  l»V*  •  „v>fc  4  l*  U  *  >  i**t  I  ♦  '  Jtlrtl^A^1  »,W  *iJ«9 
//•»*(  *  v*M»wV«i  .  ly /*  f 
//  t  U  C  *  w  *  1  > v  w 
S/»CM  dVS }*  JO  • 

I  (  u«v*  ^  *4  M  i4  •  i  *  A.ij»  A  **  1 1  *Tc  »  p  *c  *  f  •*  *t  i  •&  1  r*  $ 

•  *  i,  T*'**>»A*wt<«jf  fe'*T>  i  *hI  •  »  i  •  •*»  •  i  ■<•  i  vAw  it  a«*4«!  t 

V  tut  i  •  I  **AUS  j 

lNTt4t4*»«»  9k*>«k  i^lnt 
*ATA  OtA.«A/T  V,W*«/*  -*•/ 

J C  j«JO 


SSMl 

AUtS 
H  tfcw 
iE« 

A  ACl 

*  tTH 

»Lr  rt 

^rst 

■  T 

to  A  IV 

•  Al  V  ki 

t  *A*jT 

hi 


•  Iti'W 

•  dv  *<>a 

•  ni  <1 

•  -* 

*  ••# 

•  -« 

•  *v 

•  -<♦ 

•  -t 

•  -v 

•  -f 

•  -* 

•  — 1 

•  -♦ 

•  -* 

•  -¥ 

•  -f 

•  -V 

•  -f 


Af  ACU,lQO»SSA*W  >S*N*  •  l!4«J».Mttt  .rife*  »&£a.aAwc  •*Tis, 
C'4l  )  I  *H  I  *  •!  (RAiV  *  «Ai  •  At  |(AaM«T  • 


TO  I  »  i  P\»  tAU4«U»  •  i  AHui  4 


*AlTl(*,2JJ|£SAKl|i  iAu  «!SA0^4ttc»^TtC  iHa  iK<twii(tnt 

•  M  f  H  j  »Af  (*tf«i  E'lti  I  iH  i  ♦  •  f  *  a  A  i  V  »  *  A  i  V  At  »  t  AAH^t  $ 

•  Tut  .  tPi»dOHUS,f  »«j»4 

CLNT iKti 

f  0^  AT  IjO.JJx.  I2d*»  12,  llilliUi  li,l2f+fttU'*JA, 

•  {  1  •  1  i  »  i  J  »  4  &  *  »  i  2  i 

sji:|i:jl:ti:i!:li:ii:  ti:!i:ti:t::t*:!i:ti: 

MiiUf 

thU 

•  MOIPOOJ  00  JH  II«  J  330V  .  *Jt«  *•»,  0  lit'-iut.  0)  > 

if  OOl  Ju  *J<*  i  t  •  i  3  3  J  *  S  I*  ■*%•»}  0%  (IS  1  f  l«l*  t  fctt  A  I  • 

iHLi  •  IUl  il^i^M  i  Oi.  ••  lKct.F»«Aa>t«  iC  L'MilWi  it  *i  tj-ta  I  • 
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c 

c 

//.'I 

//>**• 
i  / 

//*  V. 


rn-Hki  Jit 
A  4  S 
n  '  t  C 

*T  .  S'  *  IN  w!  • 

>  >am  ,  *,  v  ; . .; 
•  Cr<M,av»*tn’lMi'\  i- 
•*  /  »M  *  ,  A  P  1  ■_  .  *  P  T  J  ,  A  '  .  t  A  ►* 

«  AP  f  «,.*.*!  N#  A  P|  v,  ,  A*-1  1*>  ,  Jf 

|  '4  T*  cl  A>4A  t  ^  |  N  t 

w  **  I  a  »h  A  Ka.  /  *  « /  .S  »  St /  1 

OL  10  li  i*k,ij77d.lJ 


U  »  *  f  .  a  f  ,  i  •'  ,  A  r  r  ..  , 
’  [o.M  P  I  i  ,  T  . 


Be  A  ‘4* 
t>C  mN* 
Nl  4t 

-s 

-9 

be  4 Nil 

Si  N 
-v 
-9 
-S 

•  9 
-9 
-9 
-9 
~  9 
-9 
-9 
-9 
-9 
-9 
-9 
-9 
~9 
•9 
-9 
“9 
-9 


300 

XOO 

200 


✓* 

//GC 

// 

// 

//i»0 

// 


*}an  1 

s  i  tN  ? 

A  >an  3 

C  tNJ 
2  iPl 
2  1P2 

ri\j* 

A  U1C 

T  F 

Af  WT  P 
APIA 
APIS 
APTC 
APT3 

AP]r 
APT? 

A  P  T  „ 

:.  p  T  h 
APT  1 
APT  J 
APT* 

A  P 1 1. 

APTS 
Af  T.W 

a  f  :  j 

APTP 
AFECS 

ft E  AC  (  1 ,  1  00  I  SSAM  ,S  SA92 
A  C  t  N*  .C  t  SO*  <1  I  P  1  •  i  l  Pc.  *niM  j 
»  Btl.iC.If  .AFwTP,  APT  A,  API  fi, 

»  AP  TC  #AP  T  J.APT  C  .  Af'lf  , 

»  APJu.APTH.APT  I  ,  APTj  .APT  A. 

•  APT  4.  .APTK.APTN.  APTu.APTf 

**1TH2.2C0I  a  S  as  l .  SjAN*.jjAM. 

•  Lt  SB  ]bt  NO  »  L  i  p  l  *  l  4  <r'i  .  HuK  . 

-  h?  c  iC  .T  f  .Aft,  TP  .  APT-,  fAP  T  8  , 

•  AP  K  fAPllifAPT  £  •  APTf  , 

•  aPTu.aPIm.aP! I.APTj.aPTk. 
p  AP  U  .APT *, APTS, APTu  f  AP  T  P  .  AP££S 

CCM  INC* 

S  IoP 

PC PS  AT (  3  A  3  .  (  2  .  >  lfAi.A2.I2vl2A.il. *  a  » l 2.12.  1a.  1 2  i 
U.l  2.  I2fiA.li.12  .PA*  1  2  »  i  4  »12»l«.«'.OAf  i4l 
f OHM  AT  I  3  A  3  •  1  A  .  / 

4fiA.12flA.2Ai*  l*.  I  2  i  1 A  .  U  ,  U ,12  .  4  A ,  1 2  *  1  A  . 
&.lA.li«iAtl2flA.l«:.lA,i2*lAtl^,^Af 
«,lA,i4,lA,i2ilA.i&.  lA,l<,iA.l2,  1  A  «  i  «  ,  1  A  ,  i«.  t  U  i 
2,  lA.  12,1  A,  121 
CNO 

•  PTOIPCCI  DC  IT  •  3  3  30V  .  vU  l  *  S  c  0  *  2  32  ♦  1 1  .>  P*  (  Ol  0 1  . 

OCb«(ktCFH"fo»L*tCi*  J<.4#dcAii2t:*l«.  /  in  , 

t)ir>  AMc-P^S.il-*  72  .  WF  T  79 

•  rlCtPCCl  JU  0T4I  I  *  j  i  ?0.  »lL«  Si  a  *Hv  jO  OS  ,u  1  Sp- <  SmK  I  • 
aPAC^"ICTL.49.*.1  I  t  uu*  l  hfc  ur-  P*  F  b  .C-t  cll*  o  j  .  d  La  j  1  2  e  •  1  9  J'.O  *  « 


I  2. 12  ,  U 


//  JSNAHE-S1972.V.79C 
// 


,  12,12 
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AEmm  2 

APL  rBOGIlUS  FOB  DATA  aAHIPOLATIC* 


rc  o  3? 

n  % t* i » ^ 

••*«©**•  A#*ic,n 

•**«rc*r»***ic  n* 
►  t  C  »  33 

»«o  «.»»«**  vC  ,43 

c> «ve*>»«fT It ,43 

•  |  73 
*•(«»»•**  +  jf if  3 
••t  «€*►••»▼  t  C  If) 

«  *r»**MC ,jo3 

•  i  ii] 
t tn» r • l C I i 2  3 

t^c  133 

,43 

t.»r»*r?3C 

» 


rori cQ3» 

t  •«copt  r*r*,e,p,»ccMr»re*,3tiT* 

C 1 3  ?*«  COCW»N  TO  OK  okccoko 

C23  C*Q 

C3 J  0* *  !«**•?  tM«  BiAtn  to  oc  oteooKO- 

C*3  t*C 

C33  0*.*  THtOtT  TM«  HU«OKO  TO  OK  •KCOPCt*  *0  . 

t*3  HCOOt^O 

C73  t|TOA*.®4TA 

C  3  3  ***•[  |C3#-i*4T*C»C3*» 

t?3  ootoc  |C3*.o*toc  |C3«Hcoot 

C 103  MATA^IATA 

C 1 1 3  »e3*0b*TAc  ic) ♦?*▼©*[  ic)* (ooataj |C3*mcopc > 

C 123  mmt^»b4ta 

C  1  3  3  OKCOPKO  OOTO  It  NO«  A  OkOPAk  VAftAOkt  C*CLtt  »Kt>*T« 


C  1  3 
L  2  Z 
t  33 

c  4 : 
C53 
C43 
c*3 
181 
C?3 
tic: 
ui: 
C  12  3 
C  1  3  3 
C  1  4  3 
C153 
C  1*3 
C  1 7  3 
Cl  9  3 
11?  3 
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i 

/\ 


v  »»**<  if  i  tr:iiiv»u-' 

C  1  2  •  *«•>*  c*«ooim  *  tuiirt  ir  *t> 

C23  *r*if ▼**»  «*t*n  o*  ?m*»  iumit 
£*2  Q»  1  COCUMM  HUNM«  QW  rut  IIlIC^IOh  v:CTOi  ' 

C  4  2  c  ue 
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APBESDIX  H 

BOXPLOT  AH1ITSI5  CP  BEHAIHIHG  VABIAELZS 

Bczplots  cf  the  remaining  candidate  explanatory 
variables  versus  length  of  service  are  provided  in 
this  appendix.  Befer  to  Chapter  3,  pages  62 
through  66  for  discussion  on  each  of  these 
bczplots.  Bemaining  candidate  explanatory  vari¬ 
ables  displayed  in  this  appendix  are  listed  in 
Table  X7I 


TABLE  XVI 

Candidate  Explanatory  Variables 


flental  Category 

Marital  Status 

Age 

Sex 

Bace 

Beenlistsent  Code 
Character  of  Service 


Figure  H.2  Hental  Category  vs.  length  of  Service, II 
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Figure  H.9  Hental  Category  vs.  Length  of  Service# IX 
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Figure  H.  10  Marital  status  vs.  Service 
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figure  H. 11  Harital  Status  »s-  Ser*ice#II 
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figure  B. 12  Marital  Status  vs.  Service, III 
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figure  H. 13  Marital  Status  vs.  Service, IV 
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figure  H. 14  Marital  Status  vs.  Service 


figure  H. 15  Barital  Status  vs.  Service, vi 


Figure  H.16  Age  ts(  Service 
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Figure  H.  17  Age  vs.  Service, II 


pigore  H.20  Age  vs.  Service 
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Figure  H.23  lge  is.  Seriice#FIlI 


Figure  H.24  ige  vs.  Service, IX 
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Figure  B.26  Sex  vs.  Service#!! 
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Figure  H.  27  Sex  vs.  Service, III 


figure  H.29  Sex  vs.  Service 
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Figure  H. 31  Bace  vs.  Service 


Figure  H.  32  fiace  vs.  Service, II 
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Figure  8.33  3ace  vs.  Service, HI 
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Figure  H.34  Eace  vs.  Service, IV 


A"»NQ  OSHH* 


8  9 
•  *- 
1  < 
3  3 

i  2 

f  o 

*  Ui 


4s 

T 

Jl  - 

<  f*  *■  u» 

8li  S 

*•*►**  T 


y 

i  S  5 


*»  « 


o -ooo  o 


•*  «n«ai 

S'rgNJ*' 

♦  **  w 


SS-S 

iOO»n  « 


f»S 

S8SSS 

8*«*» 
w»  rf»o 

2?mo*S 
§  '■"!!- 


3a00  J.N3WtSnN33a 


I  ; 

s * 

.15  ; 

v  <  «  r 

u*  >*  w  T 


*  ff 

V  i  *! 

§  5  * 


C  l  "  l 

3000  lN3KUSnN33a 


o«ooo»o< 


ff  ffff •» 

~  ~  ••  ©•— 

*  - «  »  «  m 

X  «•»* * 


2 

♦  ♦ff 

r«  **  f*  ♦2«S8S 

ff 

8SSS8888S 
*00000000 


Figure  H.38  Beenlistnent  Code  vs.  Education  level, II 
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Figure  H.40  Beenlistient  Coie  vs.  Service,!! 


Figure  H.<t2  fieenlistaent  Co4e  vs.  Service, IV 


figure  B.43  Beenlistsent  Code  vs.  Service 


Figure  H.44  Beenlistment  Code  vs.  Service, VI 


Figure  B.45  Character  of  Service  vs.  Education  Level, I 
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Figure  H.47  Character  of  Service  vs.  Education  Level, III 
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Figure  B.  51  Character  of  Service  vs.  Service, IV 
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Figure  H.  52  Character  of  Service  vs.  Service 
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Pigure  H.  53  Character  of  Service  vs.  Service, VI 


Figure  fl.5«  Character  of  Service  vs.  Service,  VII 
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SURVIVOR  COHVES  FOE  3  TEAR  ENLISTEES 

This  Appendix  contains  survivor  curves  for  FY79  3 
year- obligated  enlistee?  for  the  six  candidate 
explanatory  variables  listed  in.  Table  XVII  belcv. 
Tabular  suaaaries  of  the  analysis  and  discussion  of 
the  analysis  is  provided  in  Chapter  4  of  this 
thesis. 


TABLE  XVII 

Candidate  Explanatory  Variables 


Figure 


Figure 
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Figure  K*  4  Barital  Status/  Vaaber  of  Dependents 


Figure  K.7  Hilitary  Occupational  Skill 


APPENDIX  L 

SUBVIVOB  COBVES  FOB  4  I BAB  ENLISTEES 

This  Appendix  contains  survivor  curves  for  FY79  4 
year-obligated  enlistees  for  the  six  candidate 
explanatory  variables  listed  in  Table  X7II  below. 
Tabular  suaaaries  cf  the  analysis  and  discussion  of 
the  analysis  is  provided  in  Chapter  4  of  this 
thesis. 


TABLE  XVIII 

Candidate  Explanatory  variables 


Sex 

Bace 

Hental  Category 

Narital  Star us/ Number  of  Dependents 
Hilitary  Occupational  Skill 
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Figure  1,3 
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